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NOVEL TELOMERASE 

The present application is a Continuation-In-Part application of U.S. Patent 

Appln. Ser. No. 08/ , filed 04/18/97, which is a Continuation-in-Part of U.S. 

Patent Appln. Ser. No. 08/724,643, filed on October 1, 1996. 

5 FIELD OF THE INVENTION 

The present invention is related to novel telomerase genes and proteins. In 
particular, the present invention is directed to a telomerase isolated from Euplotes 
aediculatus, the two polypeptide subunits of this telomerase, as well as sequences of 
the human homolog of the E. aediculatus telomerase. 

10 BACKGROUND OF THE INVENTION 

Telomeres, the protein-DNA structures physically located on the ends of the 
eukaryotic organisms, are required for chromosome stability and are involved in 
chromosomal organization within the nucleus (See e.g., Zakian, Science 270:1601 
[1995]; Blackburn and Gall, J. Mol. Biol, 120:33 [1978]; Oka et al, Gene 10:301 

15 [1980]; and Klobutcher et al, Proc. Natl. Acad. Sci., 78:3015 [1981]). Telomeres are 

believed to be essential in such organisms as yeasts and probably most other 
eukaryotes, as they allow cells to distinguish intact from broken chromosomes, protect 
chromosomes from degradation, and act as substrates for novel replication 
mechanisms. Telomeres are generally replicated in a complex, cell cycle and 

20 developmentally regulated, manner by "telomerase," a telomere-specific DNA 

polymerase. However, telomerase-independent means for telomere maintenance have 
been described. In recent years, much attention has been focused on telomeres, as 
telomere loss has been associated with chromosomal changes such as those that occur 
in cancer and aging. 
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Telomeric DNA 

In most organisms, telomeric DNA has been reported to consist of a tandem 
array of very simple sequences, which in many cases are short and precise. Typically, 
telomeres consist of simple repetitive sequences rich in G residues in the strand that 
5 runs 5' to 3' toward the chromosomal end. For example, telomeric DNA in 

Tetrahymena is comprised of sequence T2G4, while in Oxytricha, the sequence is T4G4, 
and in humans the sequence is T2AG3 {See e.g., Zakian, Science 270:1601 [1995]; and 
Lingner et al, Genes Develop., 8:1984 [1994]). However, heterogenous telomeric 
sequences have been reported in some organisms {e.g., the sequence TG1.3 in 

10 Saccharomyces). In addition, the repeated telomeric sequence in some organisms is 

much longer, such as the 25 base pair sequence of Kluyveromyces lactis. Moreover, 
the telomeric structure of some organisms is completely different. For example, the 
telomeres of Drosophila are comprised of a transposable element {See, Biessman et al, 
Cell 61:663 [1990]; and F.-m Sheen and Levis, Proc. Natl. Acad. Sci., 91:12510 

15 [1994]). 

The telomeric DNA sequences of many organisms have been determined {See 
e.g, Zakian, Science 270:1601 [1995]). However, it has been noted that as more 
telomeric sequences become known, it is becoming increasingly difficult to identify 
even a loose consensus sequence to describe them (Zakian, supra). Furthermore, it is 

20 known that the average amount of telomeric DNA varies between organisms. For 

example, mice may have as many as 150 kb (kilobases) of telomeric DNA per 
telomere, while the telomeres of Oxytricha macronuclear DNA molecules are only 20 
bp in length (Kipling and Cooke, Nature 347:400 [1990]; Starling et al, Nucleic Acids 
Res., 18:6881 [1990]; and Klobutcher et al, Proc. Natl. Acad. Sci., 78:3015 [1981]). 

25 Moreover, in most organisms, the amount of telomeric DNA fluctuates. For example, 

the amount of telomeric DNA at individual yeast telomeres in a wild-type strain may 
range from approximately 200 to 400 bp, with this amount of DNA increasing and 
decreasing stoichastically (Shampay and Blackburn, Proc. Natl. Acad. Sci., 85:534 
[1988]). Heterogeneity and spontaneous changes in telomere length may reflect a 
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complex balance between the processes involved in degradation and lengthening of 
telomeric tracts. In addition, genetic, nutritional and other factors may cause increases 
or decreases in telomeric length (Lustig and Petes, Natl. Acad. Sci., 83:1398 [1986]; 
and Sandell et al, Cell 91:12061 [1994]). The inherent heterogeneity of virtually all 
5 telomeric DNAs suggests that telomeres are not maintained via conventional replicative 

processes. 

In addition to the telomeres themselves, the regions located adjacent to 
telomeres have been studied. For example, in most organisms, the sub-telomeric 
regions immediately internal to the simple repeats consist of middle repetitive 
10 sequences, designated as telomere-associated ("TA") DNA. These regions bear some 

similarity with the transposon telomeres of Drosophila. In Saccharomyces, two classes 
of TA elements, designated as "X" and "Y,"' have been described (Chan and Tye, Cell 
33:563 [1983]). These elements may be found alone or in combination on most or all 
telomeres. 

15 Telomeric Structural Proteins 

Various structural proteins that interact with telomeric DNA have been 
described which are distinct from the protein components of the telomerase enzyme. 
Such structural proteins comprise the "telosome" of Saccharomyces chromosomes 
(Wright et al, Genes Develop., 6:197 [1992]) and of ciliate macronuclear DNA 

20 molecules (Gottschling and Cech, Cell 38:501 [1984]; and Blackburn and Chiou, Proc. 

Natl. Acad. Sci., 78:2263 [1981]). The telosome is a non-nucleosomal, but discrete 
chromatin structure that encompasses the entire terminal array of telomeric repeats. In 
Saccharomyces, the DNA adjacent to the telosome is packaged into nucleosomes. 
However, these nucleosomes are reported to differ from those in most other regions of 

25 the yeast genome, as they have features that are characteristic of transcriptionally 

inactive chromatin (Wright et al, Genes Develop., 6:197 [1992]; and Braunstein et al, 
Genes Develop., 7:592 [1993]). In mammals, most of the simple repeated telomeric 
DNA is packaged in closely spaced nucleosomes (Makarov et al. Cell 73:775 [1993]; 
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and Tommerup et al, Mol. Cell. Biol., 14:5777 [1994]). However, the telomeric 
repeats located at the very ends of the human chromosomes are found in a telosome- 
like structure. 

Telomere Replication 

5 Complete replication of the ends of linear eukaryotic chromosomes presents 

special problems for conventional methods of DNA replication. For example, 
conventional DNA polymerases cannot begin DNA synthesis de novo, rather, they 
require RNA primers which are later removed during replication. In the case of 
telomeres, removal of the RNA primer from the lagging-strand end would necessarily 

10 leave a 5 '-terminal gap, resulting in the loss of sequence if the parental telomere was 

blunt-ended (Watson, Nature New Biol., 239:197 [1972]; Olovnikov, J. Theor. Biol., 
41:181 [1973]). However, the described telomeres have 3' overhangs (Klobutcher et 
al, Proc. Natl. Acad. Sci., 58:3015 [1981]; Henderson and Blackburn, Mol. Cell. 
Biol., 9:345 [1989]; and Wellinger et al, Cell 72:51 [1993]). For these molecules, it 

15 is possible that removal of the lagging-strand 5 '-terminal RNA primer could regenerate 

the 3' overhang without loss of sequence on this side of the molecule. However, loss 
of sequence information on the leading-strand end would occur, because of the lack of 
a complementary strand to act as template in the synthesis of a 3' overhang (Zahler 
and Prescott, Nucleic Acids Res., 16:6953 [1988]; Lingner et al, Science 269:1533 

20 [1995]). 

Nonetheless, complete replication of the chromosomes must occur. While 
conventional DNA polymerases cannot accurately reproduce chromosomal DNA ends, 
specialized factors exist to ensure their complete replication. Telomerase is a key 
component in this process. Telomerase is a ribonucleoprotein (RNP) particle and 
25 polymerase that uses a portion of its internal RNA moiety as a template for telomere 

repeat DNA synthesis (Yu et al, Nature 344:126 [1990]; Singer and Gottschling, 
Science 266:404 [1994]; Autexier and Greider, Genes Develop., 8:563 [1994]; Gilley 
et al. Genes Develop., 9:2214 [1995]; McEachem and Blackburn, Nature 367:403 
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[1995]; Blackburn, Arm. Rev. Biochem., 61:113 [1992];. Greider, Ann. Rev. 
Biochem., 65:337 [1996]). The activity of this enzyme depends upon both its RNA 
and protein components to circumvent the problems presented by end replication by 
using RNA {i.e., as opposed to DNA) to template the synthesis of telomeric DNA. 
5 Telomerases extend the G strand of telomeric DNA. A combination of factors, 

including telomerase processivity, frequency of action at individual telomeres, and the 
rate of degradation of telomeric DNA, contribute to the size of the telomeres {i.e., 
whether they are lengthened, shortened, or maintained at a certain size). In vitro, 
telomerases may be extremely processive, with the Tetrahymena telomerase adding an 

10 average of approximately 500 bases to the G strand primer before dissociation of the 

enzyme (Greider, Mol. Cell. Biol., 114572 [1991]). 

Importantly, telomere replication is regulated both by developmental and cell 
cycle factors. It has been hypothesized that aspects of telomere replication may act as 
signals in the cell cycle. For example, certain DNA structures or DNA-protein 

15 complex formations may act as a checkpoint to indicate that chromosomal replication 

has been completed {See e.g., Wellinger et al, Mol. Cell. Biol., 13:4057 [1993]). In 
addition, it has been observed that in humans, telomerase activity is not detectable in 
most somatic tissues, although it is detected in many tumors (Wellinger, supra). This 
telomere length may serve as a mitotic clock, which serves to limit the replication 

20 potential of cells in vivo and/or in vitro. What remains needed in the art is a method 

to study the role of telomeres and their replication in normal as well as abnormal cells 
{i.e., cancerous cells). An understanding of telomerase and its function is needed in 
order to develop means for use of telomerase as a target for cancer therapy or anti- 
aging processes. 



25 SUMMARY OF THE INVENTION 

The present invention provides compositions and methods for purification and 
use of telomerase. In particular, the present invention is directed to telomerase and co- 
purifying polypeptides obtained from Euplotes aediculatus. 
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The present invention provides heretofore imknown telomerase subunit proteins 
of approximately 123 kDa and 43 kDa, as measiired on SDS-PAGE. In particular, the 
present invention provides substantially purified 123 kDa and 43 kDa telomerase 
protein subunits. 

5 One aspect of the invention features isolated and substantially purified 

polynucleotides which encode telomerase subunits (i.e., the 123 kDa and 43 kDa 
protein subunits). In a particular aspect, the polynucleotide is the nucleotide sequence 
of SEQ ID NO: 1, or variants thereof In an alternative embodiment, the present 
invention provides fragments of the isolated (i.e., substantially purified) polynucleotide 

10 encoding the telomerase 123 kDa subunit of at least 10 amino acid residues in length. 

The invention further contemplates fragments of this polynucleotide sequence (i.e., 
SEQ ID N0:1) that are at least 6 nucleotides, at least 25 nucleotides, at least 30 
nucleotides, at least 100 nucleotides, at least 250 nucleotides, and at least 500 
nucleotides in length. In addition, the invention features polynucleotide sequences that 

15 hybridize under stringent conditions to SEQ ID NO: 1, or fragments thereof The 

present invention further contemplates a polynucleotide sequence comprising the 
complement of the nucleic acid of SEQ ID NO: 1, or variants thereof 

The present invention also provides the polynucleotide with the sequence of 
SEQ ID N0:3. In particular, the present invention provides the polynucleotide 

20 sequence comprising at least a portion of the nucleic acid sequence of SEQ ID N0:3, 

or variants, thereof In one embodiment, the present invention provides fragments of 
the isolated (i.e., substantially purified) polynucleotide encoding the telomerase 43 kDa 
subimit of at least 10 amino acid residues in length. The invention also provides an 
isolated polynucleotide sequence encoding the polypeptide of SEQ ID NOS:4-6, or 

25 variants thereof The invention further contemplates fragments of this polynucleotide 

sequence (i.e., SEQ ID N0:3) that are at least 5 nucleotides, at least 20 nucleotides, at 
least 100 nucleotides, at least 250 nucleotides, and at least 500 nucleotides in length. 
In addition, the invention features polynucleotide sequences that hybridize under 
stringent conditions to SEQ ID NO: 3, or fragments thereof The present invention 
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further contemplates a polynucleotide sequence comprising the complement of the 
nucleic acid of SEQ ID N0:3, or variants thereof. 

The present invention provides a substantially purified polypeptide comprising 
at least a portion of the amino acid sequence of SEQ ID N0:2, or variants thereof. In 
5 one embodiment, the portion of the polypeptide sequence comprises fragments of SEQ 

ID N0:2, having a length greater than 10 amino acids. However, the invention also 
contemplates polypeptide sequences of various lengths, the sequences of which are 
included within SEQ ID N0:2, ranging from 5-500 amino acids. The present 
invention also provides an isolated polynucleotide sequence encoding the polypeptide 

10 of SEQ ID N0:2, or variants, thereof 

The present invention provides a substantially purified polypeptide comprising 
at least a portion of the amino acid sequence selected from the group consisting of 
SEQ ID NO:4-6, or variants thereof In one embodiment, the portion of the 
polypeptide comprises fragments of SEQ ID N0:4, having a length greater than 10 

15 amino acids. In an alternative embodiment, the portion of the polypeptide comprises 

fragments of SEQ ID N0:5, having a length greater than 10 amino acids. In yet 
another alternative embodiment, the portion of the polypeptide comprises fragments of 
SEQ ID N0:6, having a length greater than 10 amino acids. The present invention 
also contemplates polypeptide sequences of various lengths, the sequences of which are 

20 included within SEQ ID N0S:4, 5, and/or 6, ranging from 5 to 500 amino acids. 

The present invention also provides a telomerase complex comprised of at least 
one purified 123 kDa telomerase protein subunit, at least one a purified 43 kDa 
telomerase protein subunit, and purified RNA. In a preferred embodiment, the 
telomerase complex comprises one purified 123 kDa telomerase protein subunit, one 

25 purified 43 kDa telomerase protein subunit, and purified telomerase RNA. In one 

preferred embodiment, the telomerase complex comprises an 123 kDa and/or 
telomerase protein subunit obtained from Euplotes aediculatus. It is contemplated that 
the 123 kDa telomerase protein subunit of the telomerase complex be encoded by SEQ 
ID N0:1. It is also contemplated that the 123 kDa telomerase protein subunit of the 

30 telomerase complex be comprised of SEQ ID N0:2. It is also contemplated that the 

- 7 - 




43 kDa telomerase protein subunit of the telomerase complex be obtained from 
Euplotes aediculatus. It is further contemplated that the 43 kDa telomerase subunit of 
the telomerase complex be encoded by SEQ ID N0:3. It is also contemplated that the 
43 kDa telomerase protein subunit of the telomerase complex be comprised of the 
amino acid sequence selected from the group consisting of SEQ ID N0:4, SEQ ID 
N0:5, and SEQ ID N0:6. It is contemplated that the purified RNA of the telomerase 
complex be comprised of the RNA encoded by such sequences as those disclosed by 
Linger et al, (Lingner et al. Genes Develop., 8:1985 [1994]). In a preferred 
embodiment, the telomerase complex is capable of replicating telomeric DNA. 

The present invention also provides methods for identifying telomerase protein 
subunits in eukaryotic organisms other than E. aediculatus. These methods are 
comprised of multiple steps. The first step is the synthesis of at least one probe or 
primer oligonucleotide that encodes at least a portion of the amino acid sequence of 
SEQ ID N0S:2, 4, 5, or 6. In the alternative, the synthesized probe or primer 
oligonucleotides are complementary to at least a portion of the amino acid sequence of 
SEQ ID N0:2, 4, 5, or 6. The next step comprises exposing at least one of the probe 
or primer oligonucleotide(s) to nucleic acid comprising the genome or, in the 
alternative, the expressed portion of the genome of the other organism {i.e., the non-£:. 
aediculatus organism), under conditions suitable for the formation of nucleic acid 
hybrids. Next, the hybrids are identified with or without amplification, using a DNA 
polymerase {e.g., Taq, or any other suitable polymerase known in the art). Finally, the 
sequence of the hybrids are determined using methods known in the art, and the 
sequences of the derived amino acid sequences analyzed for their similarity to SEQ ID 
N0S:2, 4, 5, or 6. 

The present invention also provides methods for identifying nucleic acid 
sequences encoding telomerase protein subunits in eukaryotic organisms comprising the 
steps of: providing a sample suspected of containing nucleic acid encoding an 
eukaryotic telomerase protein subunit; at least one oligonucleotide primer 
complementary to the nucleic acid sequence encoding at least a region of an Euplotes 
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aediculatus telomerase protein subunit; and iii) a polymerase; exposing the sample to 
the at least one oligonucleotide primer and the polymerase under conditions such that 
the nucleic acid encoding the eukaryotic telomerase protein subunit is amplified; 
determining the sequence of the eukaryotic telomerase protein subunit; and 
comparing the sequence of the eukaryotic telomerase protein subunit and the Euplotes 
aediculatus telomerase protein subunit. In one preferred embodiment, the Euplotes 
aediculatus telomerase subunit comprises at least a portion of SEQ ID N0:1. In an 
alternative preferred embodiment, the Euplotes aediculatus telomerase subunit 
comprises at least a portion of SEQ ID N0:3. 

Thus, the present invention also provides methods for identification of 
telomerase protein subunits in eukaryotic organisms other than E. aediculatus. In 
addition, the present invention provides methods for comparisons between the amino 
acid sequences of SEQ ID N0S:2, 4, 5, or 6, and the amino acid sequences derived 
from gene sequences of other organisms or obtained by direct amino acid sequence 
analysis of protein. The amino acid sequences shown to have the greatest degree of 
identity {i.e., homology) to SEQ ID N0S:2, 4, 5, or 6, may then selected for further 
testing. Sequences of particular importance are those that share identity with the 
reverse transcriptase motif of the Euplotes sequence. Once identified, the proteins with 
the sequences showing the greatest degree of identity may be tested for their role in 
telomerase activity by genetic or biochemical methods, including the methods set forth 
in the Examples below. 

The present invention also provides methods for purification of telomerase 
comprising the steps of providing a sample containing telomerase, an affinity 
oligonucleotide, a displacement oligonucleotide; exposing the sample to the affinity 
oligonucleotide under conditions wherein the affinity oligonucleotide binds to the 
telomerase to form a telomerase-oligonucleotide complex; and exposing the 
oligonucleotide-telomerase complex to the displacement oligonucleotide under 
conditions such that the telomerase is released from the template. In a preferred 
embodiment, the method comprises the further step of eluting the telomerase. In 
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another preferred embodiment, the affinity oHgonucleotide comprises an antisense 
portion and a biotin residue. It is contemplated that during the exposing step, the 
biotin residue of the affinity oligonucleotide binds to an avidin bead and the antisense 
portion binds to the telomerase. It is also contemplated that during the exposing step, 
the displacement oligonucleotide binds to the affinity oligonucleotide. 

The present invention further provides substantially purified polypeptides 
comprising the amino acid sequence comprising SEQ ID N0S:61, 63, 64, 65, 67, and 
68. In another embodiment, the present invention also provides purified, isolated 
polynucleotide sequences encoding the polypeptides comprising the amino acid 
sequences of SEQ ID N0S:61, 63, 64, 65, 67, and 68. The present invention 
contemplates portions or fragments of SEQ ID N0S:61, 63, 64, 65, 67, and 68, of 
various lengths. In one embodiment, the portion of polypeptide comprises fragments 
of lengths greater than 10 amino acids. However, the present invention also 
contemplates polypeptide sequences of various lengths, the sequences of which are 
included within SEQ ID N0S:61, 63, 64, 65, 67, and 68, ranging firom 5 to 500 amino 
acids (as appropriate, based on the length of SEQ ID N0S:61, 63, 64, 65, 67, and 68). 

The present invention also provides nucleic acid sequences comprising SEQ ID 
NOS:55, 62, 66, and 69, or variants thereof The present invention further provides 
fragments of the isolated polynucleotide sequences that are at least 6 nucleotides, at 
least 25 nucleotides, at least 30 nucleotides, at least 50 nucleotides, at least 100 
nucleotides, at least 250 nucleotides, and at least 500 nucleotides in length (as 
appropriate for the length of the sequence of SEQ ID NOS:55, 62, 66, and 69, or 
variants thereof). 

In particularly preferred embodiments, the polynucleotide hybridizes 
specifically to telomerase sequences, wherein the telomerase sequences are selected 
from the group consisting of human, Euplotes aediculatus, Oxytricha, 
Schizosaccharomyces, and Saccharomyces telomerase sequences. In other preferred 
embodiments, the present invention provides polynucleotide sequences comprising the 
complement of nucleic acid sequences selected fi-om the group consisting of SEQ ID 
NOS:55, 62, 66, and 69, or variants thereof In yet other preferred embodiments, the 
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present invention provides polynucleic acid sequences that hybridize under stringent 
conditions to at least one nucleic acid sequence selected from the group consisting of 
SEQ ID NO:55, 62, 66, and 69. In a further embodiment, the polynucleotide sequence 
comprises a purified, synthetic nucleotide sequence having a length of about ten to 
thirty nucleotides. 

The present invention also provides methods for detecting the presence of 
nucleotide sequences encoding at least a portion of human telomerase in a biological 
sample, comprising the steps of, providing: a biological sample suspected of containing 
nucleic acid corresponding to the nucleotide sequence of SEQ ID NO:62; the 
nucleotide of SEQ ID NO:62, or a fragment thereof; combining the biological sample 
with the nucleotide under conditions such that a hybridization complex is formed 
between the nucleic acid and the nucleotide; and detecting the hybridization complex. 

In one embodiment of the method the nucleic acid corresponding to the 
nucleotide sequence of SEQ ID NO:62 is ribonucleic acid, while in an alternative 
embodiment, the nucleotide sequence is deoxyribonucleic acid. In yet another 
embodiment of the method the detected hybridization complex correlates with 
expression of the polynucleotide of SEQ ID NO:62 in the biological sample. In yet 
another embodiment of the method, detection of the hybridization complex comprises 
conditions that permit the detection of alterations in the polynucleotide of SEQ ID 
NO:62 in the biological sample. 

The present invention also provides antisense molecules comprising the nucleic 
acid sequence complementary to at least a portion of the polynucleotide of SEQ ID 
NO:55, 62, 66, 67, and 69. In an alternatively preferred embodiment, the present 
invention also provides pharmaceutical compositions comprising antisense molecules of 
SEQ ID NOS:55, 62, 67, and 69, and a pharmaceutically acceptable excipient and/or 
other compound (e.g., adjuvant). 

In yet another embodiment, the present invention provides polynucleotide 
sequences contained on recombinant expression vectors. In one embodiment, the 
expression vector containing the polynucleotide sequence is contained within a host 
cell. 



The present invention also provides methods for producing polypeptides 
comprising the amino acid sequence of SEQ ID N0S:61, 63, 65, 67, or 69, the method 
comprising the steps of: culturing a host cell under conditions suitable for the 
expression of the polypeptide; and recovering the polypeptide from the host cell 
culture. 

The present invention also provides purified antibodies that binds specifically to 
a polypeptide comprising at least a portion of the amino acid sequence of SEQ ID 
NOS:55, 61, 63, 64, 65, 67, and/or 68. In one embodiment, the present invention 
provides a pharmaceutical composition comprising at least one antibody, and a 
pharmaceutically acceptable excipient. 

The present invention further provides methods for the detection of human 
telomerase in a biological sample comprising the steps of: providing a biological 
sample suspected of expressing human telomerase protein; and at least one antibody 
that binds specifically to at least a portion of the amino acid sequence of SEQ ID 
NOS:55, 61, 63, 64, 65, 67, and/or 68; combining the biological sample and 
antibody(ies) under conditions such that an antibody :protein complex is formed; and 
detecting the complex wherein the presence of the complex correlates with the 
expression of the protein in the biological sample. 

The present invention further provides substantially purified peptides 
comprising the amino acid sequence selected from the group consisting of SEQ ID 
NOS:71, 73, 75, 77, 79, 82, 83, 83, 85, 86, and 101. In an alternative embodiment, 
the present invention provides purified, isolated polynucleotide sequences encoding the 
polypeptide corresponding to these sequences. In preferred embodiments, the 
polynucleotide hybridizes specifically to telomerase sequences, wherein the telomerase 
sequences are selected from the group consisting of human, Euplotes aediculatus, 
Oxytricha, Schizosaccharomyces, and Saccharomyces telomerase sequences. In yet 
another embodiment, the polynucleotide sequence comprises the complement of a 
nucleic acid sequence selected from the group consisting of SEQ ID NOS:70, 72, 74, 
76, 78, 80, 81, and 100, and variants thereof In a flirther embodiment, the 
polynucleotide sequence that hybridizes under stringent conditions to a nucleic acid 
- 12 - 



sequence selected from the group consisting of SEQ ID NOS:66, 69, 80, and 81. In 
yet another embodiment, the polynucleotide sequence is selected from the group 
consisting of SEQ ID NOS:70, 72, 74, 76, 78, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 
97, 98, 99, 102, 103, 104, 105, 106, 107, 108, 109, and 110. In an alternative 
embodiment, the nucleotide sequence comprises a purified, synthetic nucleotide 
sequence having a length of about ten to fifty nucleotides. 

The present invention also provides methods for detecting the presence of 
nucleotide sequences encoding at least a portion of human telomerase in a biological 
sample, comprising the steps of, providing: a biological sample suspected of containing 
nucleic acid corresponding to the nucleotide sequence of SEQ ID NO: 100; the 
nucleotide of SEQ ID NO: 100, or a fragment thereof; combining the biological sample 
with the nucleotide under conditions such that a hybridization complex is formed 
between the nucleic acid and the nucleotide; and detecting the hybridization complex. 

In one embodiment of the method the nucleic acid corresponding to the 
nucleotide sequence of SEQ ID NO: 100 is ribonucleic acid, while in an alternative 
embodiment, the nucleotide sequence is deoxyribonucleic acid. In yet another 
embodiment of the method the detected hybridization complex correlates with 
expression of the polynucleotide of SEQ ID NO: 100 in the biological sample. In yet 
another embodiment of the method, detection of the hybridization complex comprises 
conditions that permit the detection of alterations in the polynucleotide of SEQ ID 
NO: 100 in the biological sample. 

The present invention also provides antisense molecules comprising the nucleic 
acid sequence complementary to at least a portion of the polynucleotide of SEQ ID 
NO:82 and 100. In an alternatively preferred embodiment, the present invention also 
provides pharmaceutical compositions comprising antisense molecules of SEQ ID 
NOS:82 and 100, and a pharmaceutically acceptable excipient and/or other compound 
(e.g., adjuvant). 

In yet another embodiment, the present invention provides polynucleotide 
sequences contained on recombinant expression vectors. In one embodiment, the 
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expression vector containing the polynucleotide sequence is contained within a host 
cell. 

The present invention also provides methods for producing polypeptides 
comprising the amino acid sequence of SEQ ID NOS:82, 83, 84, 85, 86, and 101, the 
method comprising the steps of: culturing a host cell under conditions suitable for the 
expression of the polypeptide; and recovering the polypeptide from the host cell 
culture. 

The present invention also provides purified antibodies that binds specifically to 
a polypeptide comprising at least a portion of the amino acid sequence of SEQ ID 
N0S:71, 73, 75, 77, 79, 82, 83, 84, 85, 86, and/or 101. In one embodiment, the 
present invention provides a pharmaceutical composition comprising at least one 
antibody, and a pharmaceutically acceptable excipient. 

The present invention further provides methods for the detection of human 
telomerase in a biological sample comprising the steps of: providing a biological 
sample suspected of expressing human telomerase protein; and at least one antibody 
that binds specifically to at least a portion of the amino acid sequence of SEQ ID 
N0S:71, 73, 75, 77, 79, 82, 83, 84, 85, 86, 87, and/or 101; combining the biological 
sample and antibody(ies) under conditions such that an antibody :protein complex is 
formed; and detecting the complex wherein the presence of the complex correlates 
with the expression of the protein in the biological sample. 

DESCRIPTION OF THE FIGURES 

Figure 1 is a schematic diagram of the affinity purification of telomerase 
showing the binding and displacement elution steps. 

Figure 2 is a photograph of a Northern blot of telomerase preparations obtained 
during the purification protocol. 

Figure 3 shows telomerase activity through the purification protocol. 

Figure 4 is a photograph of a SDS-PAGE gel, showing the presence of an 
approximately 123 kDa polypeptide and an approximately 43 kDa doublet. 
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Figure 5 is a graph showing the sedimentation coefficient of telomerase. 

Figure 6 is a photograph of a polyacrylamide/urea gel with 36% formamide. 

Figure 7 shows the putative alignments of telomerase RNA template, with SEQ 
ID NOS:43 and 44 in Panel A, and SEQ ID NOS:45 and 46 in Panel B. 

Figure 8 is a photograph of lanes 25-30 of the gel shown in Figure 6, shown at 
a lighter exposure level. 

Figure 9 shows the DNA sequence of the gene encoding the 123 kDa 
telomerase protein subunit (SEQ ID N0:1). 

Figure 10 shows the amino acid sequence of the 123 kDa telomerase protein 
subunit (SEQ ID N0:2). 

Figure 1 1 shows the DNA sequence of the gene encoding the 43 kDa 
telomerase protein subunit (SEQ ID N0:3). 

Figure 12 shows the DNA sequence, as well as the amino acid sequences of all 
three open reading frames of the 43 kDa telomerase protein subunit (SEQ ID N0S:4- 
6). 

Figure 13 shows a sequence comparison between the 123 kDa telomerase 
protein subunit of E. aediculatus (SEQ ID N0:2) and the 80 kDa polypeptide subunit 
of T. thermophila (SEQ ID NO:52). 

Figure 14 shows a sequence comparison between the 123 kDa telomerase 
protein subunit of E. aediculatus (SEQ ID N0:2) and the 95 kDa telomerase 
polypeptide of T. thermophila (SEQ ID NO:54). 

Figure 15 shows the best-fit alignment between a portion of the "La-domain" of 
the 43 kDa telomerase protein subunit of E. aediculatus (SEQ ID N0:9) and a portion 
of the 95 kDa polypeptide subunit of T. thermophila (SEQ ID NO: 10). 

Figure 16 shows the best-fit alignment between a portion of the "La-domain" of 
the 43 kDa telomerase protein subunit of E. aediculatus (SEQ ID NO: 1 1) and a 
portion of the 80 kDa polypeptide subunit of T. thermophila (SEQ ID NO: 12). 
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Figure 17 shows the alignment and motifs of the polymerase domain of the 123 
kDa telomerase protein submit of E. aediculatus (SEQ ID N0S:13 and 18) and the 
polymerase domains of various reverse transcriptases (SEQ ID NOS:14-17, and 19-22). 

Figure 18 shows the alignment of a domain of the 43 kDa telomerase protein 
subunit (SEQ ID NO:23) with various La proteins (SEQ ID NOS:24-27). 

Figure 19 shows the nucleotide sequence encoding the T. thermophila 80 kDa 
protein subunit (SEQ ID N0:51). 

Figure 20 shows the amino acid sequence of the T. thermophila 80 kDa protein 
subunit (SEQ ID NO:52). 

Figure 21 shows the nucleotide sequence encoding the T. thermophila 95 kDa 
protein subunit (SEQ ID NO: 53). 

Figure 22 shows the amino acid sequence of the T, thermophila 95 kDa protein 
subunit (SEQ ID NO:54). 

Figure 23 shows the amino acid sequence of L8543.12 ("Est2p") (SEQ ID 
NO:55). 

Figure 24 shows the alignment of the Oxytricha PGR product (SEQ ID NO: 5 8) 
with the Euplotes sequence (SEQ ID NO:59). 

Figure 25 shows the alignment of the human telomere amino acid motifs (SEQ 
ID N0:61), with portions of the tezl sequence (SEQ ID NO:63), Est2p (SEQ ID 
NO:64), and the Euplotes pl23 (SEQ ID NO:65). 

Figure 26 shows the DNA sequence of Est2 (SEQ ID NO:66). 

Figure 27 shows the amino acid sequence of a cDNA clone (SEQ ID NO: 67) 
encoding hiunan telomerase peptide motifs. 

Figure 28 shows the DNA sequence of a cDNA clone (SEQ ID NO: 62) 
encoding human telomerase peptide motifs. 

Figure 29 shows the amino acid sequence of tezl (SEQ ID NO:68). 

Figure 30 shows the DNA sequence of tezl (SEQ ID NO:69). 

Figure 31 shows the alignment of EST2p (SEQ ID NO:83), Euplotes (SEQ ID 
NO:84), and Tetrahymena (SEQ ID NO:85) sequences, as well as consensus sequence. 
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Figure 32 shows the sequences of peptides useful for production of antibodies. 

Figure 33 is a schematic summary of the tezi "^sequencing experiments. 

Figure 34 shows two degenerate primers used in PGR to identify the S. pombe 
homolog of the E. aediculatus pi 23 sequences. 

Figure 35 shows the four major bands produced in PGR using the degenerate 
primers. 

Figure 36 shows the alignment of the M2 PGR product with E. aediculatus 
pl23, S. cerevisiae, and Oxytricha telomerase protein sequences. 

Figure 37 is a schematic showing the 3' RT PGR strategy. 

Figure 38 shows the libraries and the results of screening libraries for S. pombe 
telomerase protein sequences. 

Figure 39 shows the results obtained with the //mdll [-digested positive genomic 
clones containing S. pombe telomerase sequence. 

Figure 40 is a schematic showing the 5' RT PGR strategy. 

Figure 41 shows the alignment of RT domains from telomerase catalytic 
subunits. 

Fi gure 42 shows the alignment of three telomerase sequences. 

Figure 43 shows the disruption strategy used with the telomerase genes in 5". 

pombe. 

Figure 44 shows the experimental results confirming disruption of tezl. 
Figure 45 shows the progressive shortening of telomeres in 5*. pombe due to 
tezl disruption. 

Figure 46 shows the DNA (SEQ ID NO:69) and amino acid (SEQ ID NO:68) 
sequence of tezl, with the coding regions indicated. 

Figure 47 shows the DNA (SEQ ID NO: 100) and amino acid (SEQ ID 
NO: 101) of the ORF encoding an approximately 63 kDa telomerase protein or 
fragment thereof. 

Figure 48 shows an alignment of reverse transcriptase motifs from various 
sources. 
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DEFINITIONS 

To facilitate understanding the invention, a number of terms are defined below. 

As used herein, the term "ciliate" refers to any of the protozoans belonging to 
the phylum Ciliaphora. 

As used herein, the term "eukaryote" refers to organisms distinguishable from 
"prokaryotes." It is intended that the term encompass all organisms with cells that 
exhibit the usual characteristics of eukaryotes such as the presence of a true nucleus 
bounded by a nuclear membrane, within which lie the chromosomes, the presence of 
membrane-bound organelles, and other characteristics commonly observed in 
eukaryotic organisms. Thus, the term includes, but is not limited to such organisms as 
fungi, protozoa, and animals (e.g., humans). 

As used herein, the term "polyploid" refers to cells or organisms which contain 
more than two sets of chromosomes. 

As used herein, the term "macronucleus" refers to the larger of the two types of 
nuclei observed in the ciliates. This structure is also sometimes referred to as the 
"vegetative" nucleus. Macronuclei contain many copies of each gene and are 
transcriptionally active. 

As used herein, the term "micronucleus" refers to the smaller of the two types 
of nuclei observed in the ciliates. This structure is sometimes referred to as the 
"reproductive" nucleus, as it participates in meiosis and autogamy. Micronuclei are 
diploid and are transcriptionally inactive. 

As used herein, the term "ribonucleoprotein" refers to a complex 
macromolecule containing both RNA and protein. 

As used herein, the term "telomerase polypeptide," refers to a polypeptide 
which is at least a portion of the Euplotes telomerase structure. The term encompasses 
the 123 kDa and 43 kDa polypeptide or protein subunits of the Euplotes telomerase. It 
is also intended that the term encompass variants of these protein subunits. It is 
further intended to encompass the polypeptides encoded by SEQ ID NOS: 1 and 3. 
As molecular weight measurements may vary, depending upon the technique used, it is 
not intended that the present invention be precisely limited to the 123 kDa or 43 kDa 




molecular masses of the polypeptides encoded by SEQ ID N0S:1 and 3, as determined 
by any particular method such as SDS-PAGE. 

As used herein, the terms "telomerase" and "telomerase complex" refer to 
functional telomerase enzymes. It is intended that the terms encompass the complex of 
proteins and nucleic acids found in telomerases. For example, the terms encompass 
the 123 kDa and 43 kDa telomerase protein subunits and RNA of E. aediculatus. 

As used herein, the term "capable of replicating telomeric DNA" refers to 
functional telomerase enzymes which are capable of performing the function of 
replicating DNA located in telomeres. It is contemplated that this term encompass the 
replication of telomeres, as well as sequences and structures that are commonly found 
located in telomeric regions of chromosomes. For example, "telomeric DNA" 
includes, but is not limited to the tandem array of repeat sequences found in the 
telomeres of most organisms. 

"Nucleic acid sequence" as used herein refers to an oligonucleotide, nucleotide 
or polynucleotide, and fragments or portions thereof, and to DNA or RNA of genomic 
or synthetic origin which may be single- or double-stranded, and represent the sense or 
antisense strand. Similarly, "amino acid sequence" as used herein refers to peptide or 
protein sequence. "Peptide nucleic acid" as used herein refers to an oligomeric 
molecule in which nucleosides are joined by peptide, rather than phosphodiester, 
linkages. These small molecules, also designated anti-gene agents, stop transcript 
elongation by binding to their complementary (template) strand of nucleic acid 
(Nielsen et al, Anticancer Drug Des 8:53-63 [1993]). 

A "deletion" is defined as a change in either nucleotide or amino acid sequence 
in which one or more nucleotides or amino acid residues, respectively, are absent. 

An "insertion" or "addition" is that change in a nucleotide or amino acid 
sequence which has resulted in the addition of one or more nucleotides or amino acid 
residues, respectively, as compared to, naturally occurring sequences. 

A "substitution" results from the replacement of one or more nucleotides or 
amino acids by different nucleotides or amino acids, respectively. 
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As used herein, the term "purified" refers to the removal of contaminant(s) 
from a sample. As used herein, the term "substantially purified" refers to molecules, 
either nucleic or amino acid sequences, that are removed from their natural 
environment, isolated or separated, and are at least 60% free, preferably 75% free, and 
most preferably 90% free from other components with which they are naturally 
associated. An "isolated polynucleotide" is therefore a substantially purified 
polynucleotide. 

As used herein, the term "probe" refers to an oligonucleotide (i.e., a sequence 
of nucleotides), whether occurring naturally as in a purified restriction digest or 
produced synthetically, which is capable of hybridizing to another oligonucleotide or 
polynucleotide of interest. Probes are useful in the detection, identification and 
isolation of particular gene sequences. It is contemplated that any probe used in the 
present invention will be labelled with any "reporter molecule," so that is detectable in 
any detection system, including, but not limited to enzyme (e.g., ELISA, as well as 
enzyme-based histochemical assays), fluorescent, radioactive, and luminescent systems. 
It is further contemplated that the oligonucleotide of interest (i.e., to be detected) will 
be labelled with a reporter molecule. It is also contemplated that both the probe and 
oligonucleotide of interest will be labelled. It is not intended that the present invention 
be limited to any particular detection system or label. 

As used herein, the term "target" refers to the region of nucleic acid bounded 
by the primers used for polymerase chain reaction. Thus, the "target" is sought to be 
sorted out from other nucleic acid sequences. A "segment" is defined as a region of 
nucleic acid within the target sequence. 

"Amplification" is defined as the production of additional copies of a nucleic 
acid sequence and is generally carried out using polymerase chain reaction (PGR) or 
other technologies well known in the art (e.g., Dieffenbach and Dveksler, PCR Primer, 
a Laboratory Manual, Cold Spring Harbor Press, Plainview NY [1995]). As used 
herein, the term "polymerase chain reaction" ("PCR") refers to the method of K.B. 
MulHs (U.S. Patent Nos. 4,683,195 and 4,683,202, hereby incorporated by reference). 
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which describe a method for increasing the concentration of a segment of a target 
sequence in a mixture of genomic DNA without cloning or purification. This process 
for amplifying the target sequence consists of introducing a large excess of two 
oligonucleotide primers to the DNA mixture containing the desired target sequence, 
followed by a precise sequence of thermal cycling in the presence of a DNA 
polymerase. The two primers are complementary to their respective strands of the 
double stranded target sequence. To effect amplification, the mixture is denatured and 
the primers then annealed to their complementary sequences within the target 
molecule. Following annealing, the primers are extended with a polymerase so as to 
form a new pair of complementary strands. The steps of denaturation, primer 
annealing and polymerase extension can be repeated many times {i.e., denaturation, 
annealing and extension constitute one "cycle"; there can be numerous "cycles") to 
obtain a high concentration of an amplified segment of the desired target sequence. 
The length of the amplified segment of the desired target sequence is determined by 
the relative positions of the primers with respect to each other, and therefore, this 
length is a controllable parameter. By virtue of the repeating aspect of the process, the 
method is referred to as the "polymerase chain reaction" (hereinafter "PGR"). Because 
the desired amplified segments of the target sequence become the predominant 
sequences (in terms of concentration) in the mixture, they are said to be "PGR 
amplified". 

As used herein, the term "polymerase" refers to any polymerase suitable for use 
in the amplification of nucleic acids of interest. It is intended that the term encompass 
such DNA polymerases as Taq DNA polymerase obtained from Thermus aquaticus, 
although other polymerases, both thermostable and thermolabile are also encompassed 
by this definition. 

With PGR, it is possible to amplify a single copy of a specific target sequence 
in genomic DNA to a level detectable by several different methodologies (e.g., 
hybridization with a labeled probe; incorporation of biotinylated primers followed by 
avidin-enzyme conjugate detection; incorporation of ^^P-labeled deoxynucleotide 
triphosphates, such as dGTP or dATP, into the amplified segment). In addition to 
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genomic DNA, any oligonucleotide sequence can be amplified with the appropriate set 
of primer molecules. In particular, the amplified segments created by the PGR process 
itself are, themselves, efficient templates for subsequent PGR amplifications. 
Amplified target sequences may be used to obtain segments of DNA (e.g., genes) for 
insertion into recombinant vectors. 

As used herein, the terms "PGR product" and "amplification product" refer to 
the resultant mixture of compounds after two or more cycles of the PGR steps of 
denaturation, annealing and extension are complete. These terms encompass the case 
where there has been amplification of one or more segments of one or more target 
sequences. 

As used herein, the terms "restriction endonucleases" and "restriction enzymes" 
refer to bacterial enzymes, each of which cut double- stranded DNA at or near a 
specific nucleotide sequence. 

As used herein, the term "recombinant DNA molecule" as used herein refers to 
a DNA molecule which is comprised of segments of DNA joined together by means of 
molecular biological techniques. 

As used herein, the terms "complementary" or "complementarity" are used in 
reference to polynucleotides (i.e., a sequence of nucleotides) related by the base- 
pairing rules. For example, for the sequence "A-G-T," is complementary to the 
sequence "T-G-A." Gomplementarity may be "partial," in which only some of the 
nucleic acids' bases are matched according to the base pairing rules. Or, there may be 
"complete" or "total" complementarity between the nucleic acids. The degree of 
complementarity between nucleic acid strands has significant effects on the efficiency 
and strength of hybridization between nucleic acid strands. This is of particular 
importance in amplification reactions, as well as detection methods which depend upon 
binding between nucleic acids. 

The term "homology" refers to a degree of complementarity. There may be 
partial homology or complete homology (i.e., identity). A partially complementary 
sequence is one that at least partially inhibits a completely complementary sequence 
from hybridizing to a target nucleic acid is referred to using the functional term 
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"substantially homologous." The inhibition of hybridization of the completely 
complementary sequence to the target sequence may be examined using a hybridization 
assay (Southern or Northern blot, solution hybridization and the like) under conditions 
of low stringency. A substantially homologous sequence or probe will compete for 
5 and inhibit the binding (i.e., the hybridization) of a completely homologous to a target 

under conditions of low stringency. This is not to say that conditions of low 
stringency are such that non-specific binding is permitted; low stringency conditions 
require that the binding of two sequences to one another be a specific (i.e., selective) 
interaction. The absence of non-specific binding may be tested by the use of a second 

10 target which lacks even a partial degree of complementarity (e.g., less than about 30% 

identity); in the absence of non-specific binding the probe will not hybridize to the 
second non-complementary target. 

The art knows well that numerous equivalent conditions may be employed to 
comprise either low or high stringency conditions; factors such as the length and 

15 nature (DNA, RNA, base composition) of the probe and nature of the target (DNA, 

RNA, base composition, present in solution or immobilized, etc.) and the concentration 
of the salts and other components (e.g., the presence or absence of formamide, dextran 
sulfate, polyethylene glycol) are considered and the hybridization solution may be 
varied to generate conditions of either low or high stringency hybridization different 

20 from, but equivalent to, the above listed conditions. The term "hybridization" as used 

herein includes "any process by which a strand of nucleic acid joins with a 
complementary strand through base pairing" (Coombs, Dictionary of Biotechnology, 
Stockton Press, New York NY [1994]. 

"Stringency" typically occurs in a range from about T^-5°C (5°C below the T„ 

25 of the probe) to about 20°C to 25°C below T„. As will be understood by those of 

skill in the art, a stringent hybridization can be used to identify or detect identical 
polynucleotide sequences or to identify or detect similar or related polynucleotide 
sequences. 
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As used herein, the term "T^" is used in reference to the "melting temperature." 
The melting temperature is the temperature at which a population of double-stranded 
nucleic acid molecules becomes half dissociated into single strands. The equation for 
calculating the T^ of nucleic acids is well known in the art. As indicated by standard 
5 references, a simple estimate of the T^ value may be calculated by the equation: T^ = 

81.5 + 0.41(% G + C), when a nucleic acid is in aqueous solution at 1 M NaCl {see 
e.g., Anderson and Young, Quantitative Filter Hybridisation, in Nucleic Acid 
Hybridisation (1985). Other references include more sophisticated computations which 
take structural as well as sequence characteristics into account for the calculation of 
10 T„. 

As used herein the term "hybridization complex" refers to a complex formed 
between two nucleic acid sequences by virtue of the formation of hydrogen bounds 
between complementary G and C bases and between complementary A and T bases; 
these hydrogen bonds may be further stabilized by base stacking interactions. The two 

15 complementary nucleic acid sequences hydrogen bond in an antiparallel configuration. 

A hybridization complex may be formed in solution {e.g., C^t or R^^ analysis) or 
between one nucleic acid sequence present in solution and another nucleic acid 
sequence immobilized to a solid support {e.g., a nylon membrane or a nitrocellulose 
filter as employed in Southern and Northern blotting, dot blotting or a glass slide as 

20 employed in in situ hybridization, including FISH [fluorescent in situ hybridization]). 

As used herein, the term "antisense" is used in reference to RNA sequences 
which are complementary to a specific RNA sequence {e.g., mRNA). Antisense RNA 
may be produced by any method, including synthesis by splicing the gene(s) of interest 
in a reverse orientation to a viral promoter which permits the synthesis of a coding 

25 strand. Once introduced into a cell, this transcribed strand combines with natural 

mRNA produced by the cell to form duplexes. These duplexes then block either the 
further transcription of the mRNA or its translation. In this maimer, mutant 
phenotypes may be generated. The term "antisense strand" is used in reference to a 
nucleic acid strand that is complementary to the "sense" strand. The designation (-) 
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(i.e., "negative") is sometimes used in reference to the antisense strand, with the 
designation (+) sometimes used in reference to the sense (i.e., "positive") strand. 

As used herein the term "portion" when in reference to a protein (as in "a 
portion of a given protein") refers to fragments of that protein. The fragments may 
range in size from four amino acid residues to the entire amino acid sequence minus 
one amino acid. Thus, a protein "comprising at least a portion of the amino acid 
sequence of SEQ ID N0:2" encompasses the full-length 123 kDa telomerase protein 
subunit and fragments thereof. 

The term "antigenic determinant" as used herein refers to that portion of an 
antigen that makes contact with a particular antibody (i.e., an epitope). When a 
protein or fragment of a protein is used to immunize a host animal, numerous regions 
of the protein may induce the production of antibodies which bind specifically to a 
given region or three-dimensional structure on the protein; these regions or structures 
are referred to as antigenic determinants. An antigenic determinant may compete with 
the intact antigen (i.e., the immunogen used to elicit the immune response) for binding 
to an antibody. 

The terms "specific binding" or specifically binding" when used in reference to 
the interaction of an antibody and a protein or peptide means that the interaction is 
dependent upon the presence of a particular structure (i.e., the antigenic determinant or 
epitope) on the protein; in other words the antibody is recognizing and binding to a 
specific protein structure rather than to proteins in general. For example, if an 
antibody is specific for epitope "A", the presence of a protein containing epitope A (or 
free, unlabelled A) in a reaction containing labelled "A" and the antibody will reduce 
the amount of labelled A bound to the antibody. 
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The term "sample" as used herein is used in its broadest sense. A biological 
sample suspected of containing nucleic acid encoding telomerase subunits may 
comprise a cell, chromosomes isolated from a cell (e.g., a spread of metaphase 
chromosomes), genomic DNA (in solution or bound to a solid support such as for 
5 Southern blot analysis), RNA (in solution or bound to a solid support such as for 

Northern blot analysis), cDNA (in solution or bound to a solid support) and the like. 
A sample suspected of containing a protein may comprise a cell, a portion of a tissue, 
an extract containing one or more proteins and the like. 

The term "correlates with expression of a polynucleotide," as used herein, 
10 indicates that the detection of the presence of ribonucleic acid (RNA) complementary 
to a telomerase sequence by hybridization assays is indicative of the presence of 
mRNA encoding eukaryotic telomerases, including human telomerases in a sample, 
and thereby correlates with expression of the telomerase mRNA from the gene 
encoding the protein. 

1 5 "Alterations in the polynucleotide" as used herein comprise any alteration in the 

sequence of polynucleotides encoding telomerases, including deletions, insertions, and 
point mutations that may be detected using hybridization assays. Included within this 
definition is the detection of alterations to the genomic DNA sequence which encodes 
telomerase (e.g., by alterations in pattern of restriction enzyme fragments capable of 

20 hybridizing to any sequence such as SEQ ID NOS: 1 or 3 [e.g., RFLP analysis], the 

inability of a selected fragment of any sequence to hybridize to a sample of genomic 
DNA [e.g., using allele-specific oligonucleotide probes], improper or unexpected 
hybridization, such as hybridization to a locus other than the normal chromosomal 
locus for the telomere or telomerase genes e.g., using FISH to metaphase 

25 chromosomes spreads, etc.]). 

A "variant" in regard to amino acid sequences is used to indicate an amino acid 
sequence that differs by one or more amino acids from another, usually related amino 
acid. The variant may have "conservative" changes, wherein a substituted amino acid 
has similar structural or chemical properties (e.g., replacement of leucine with 
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isoleucine). More rarely, a variant may have "non-conservative" changes, e.g., 
replacement of a glycine with a tryptophan. Similar minor variations may also include 
amino acid deletions or insertions {i.e., additions), or both. Guidance in determining 
which and how many amino acid residues may be substituted, inserted or deleted 
5 without abolishing biological or immunological activity may be found using computer 
programs well known in the art, for example, DNAStar software. Thus, it is 
contemplated that this definition will encompass variants of telomerase and/or 
telomerase protein subunits. For example, the polypeptides encoded by the three open 
reading frames (ORFs) of the 43 kDa polypeptide gene may be considered to be 

10 variants of each other. Such variants can be tested in functional assays, such as 
telomerase assays to detect the presence of functional telomerase in a sample. 

The term "derivative" as used herein refers to the chemical modification of a 
nucleic acid encoding telomerase structures, such as the 123 kDa or 43 kDa protein 
subunits of the E. aediculatus telomerase, or other telomerase proteins or peptides. 

15 Illustrative of such modifications would be replacement of hydrogen by an alkyl, acyl, 

or amino group. A nucleic acid derivative would encode a polypeptide which retains 
essential biological characteristics of naturally-occurring telomerase or its subunits. 

The term "biologically active" refers to telomerase molecules or peptides 
having structural, regulatory, or biochemical functions of a naturally occurring 

20 telomerase molecules or peptides. Likewise, "immunologically active," defines the 
capability of the natural, recombinant, or synthetic telomerase proteins or any 
oligopeptide thereof, to induce a specific immime response in appropriate animals or 
cells, and to bind with specific antibodies. 

"Affinity purification" as used herein refers to the purification of 

25 ribonucleoprotein particles, through the use of an "affinity oligonucleotide" {i.e., an 

antisense oligonucleotides) to bind the particle, followed by the step of eluting the 
particle from the oligonucleotide by means of a "displacement oligonucleotide." In the 
present invention, the displacement oligonucleotide has a greater degree of 
complementarity with the affinity oligonucleotide, and therefore produces a more 

30 thermodynamically stable duplex than the particle and the affinity oligonucleotide. For 
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example, telomerase may be boimd to the affinity oligonucleotide and then eluted by 
use of a displacement oligonucleotide which binds to the affinity oligonucleotide. In 
essence, the displacement oligonucleotide displaces the telomerase from the affinity 
oligonucleotide, allowing the elution of the telomerase. Under sufficiently mild 
conditions, the method results in the enrichment of functional ribonucleoprotein 
particles. Thus, the method is useful for the purification of telomerase from a mixture 
of compounds. 

GENERAL DESCRIPTION OF THE INVENTION 

The present invention provides purified telomerase preparations and telomerase 
protein subunits useful for investigations of the activities of telomerases, including 
potential nuclease activities. In particular, the present invention is directed to the 
telomerase and co-purifying polypeptides obtained fi-om Euplotes aediculatus. This 
organism, a hypotrichous ciliate, was chosen for use in this invention as it contains an 
unusually large number of chromosomal ends (Prescott, Microbiol. Rev., 58:233 
[1994]), because a very large number of gene-sized DNA molecules are present in its 
polyploid macronucleus. Tetrahymena, a holotrichous ciliate commonly used in 
previous studies of telomerase and telomeres, is as evolutionarily distant from Euplotes 
as plants are from mammals (Greenwood et al, J. Mol. Evol., 3:163 [1991]). 

The homology found between the 123 kDa E. aediculatus telomerase subunit 
and the L8543.12 sequence {i.e., Est2 of Saccharomyces cerevisiae; See, Lendvay et 
al. Genetics 144:1399-1412 [1996]), Schizosaccharomyces, and human motifs, 
provides a strong basis for predicting that full human telomerase molecule comprises a 
protein that is large, basic, and includes such reverse transcriptase motifs. Thus, the 
compositions and methods of the present invention is usefiil for the identification of 
other telomerases, from a wide variety of species. The present invention describes the 
use of the 123 kDa reverse transcriptase motifs in a method to identify similar motifs 
in organisms that are distantly related to Euplotes (e.g., Oxytricha), as well as 
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organisms that are not related to Euplotes {e.g., Saccharomyces, Schizosaccharomyces, 
humans, etc.). 

The present invention also provides additional methods for the study of the 
structure and function of distinct forms of telomerase. It is contemplated that the 
telomerase proteins of the present invention will be useful in diagnostic applications, 
evolutionary {e.g., phylogenetic) investigations, as well as development of 
compositions and methods for cancer therapy or anti-aging regimens. Although the 
telomerase protein subunits of the present invention themselves have utility, it further 
contemplated that the polypeptides of the present invention will be useful in 
conjunction with the RNA moiety of the telomerase enzyme {i.e., a complete 
telomerase). 

It is also contemplated that methods and compositions of this invention will 
lead to the discovery of additional unique telomerase structures and/or functions. In 
addition, the present invention provides novel methods for purification of functional 
telomerase, as well as telomerase proteins. This affinity based method described in 
Example 3, is an important aspect in the purification of functionally active telomerase. 
A key advantage of this procedure is the ability to use mild elution conditions, during 
which proteins that bind non-specifically to the column matrix are not eluted. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed to the nucleic and amino acid sequences of 
the protein subunits of the E. aediculatus telomerase, as well as the nucleic and amino 
acid sequences of the telomerases from other organisms, including humans. In 
addition, the present invention is directed to the purification of ftmctional telomerase. 
As described below the present invention also comprises various forms of telomerase, 
including recombinant telomerase and telomerase protein subunits, obtained from 
various organisms. 
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The 123 kDa and 43 kDa Telomerase Subunit Protein Sequences 

The nucleic acid and deduced amino acid sequences of the 123 and 43 kDa 
protein subunits are shown in Figures 1-6. In accordance with the invention, any 
nucleic acid sequence which encodes E. aediculatus telomerase or its subunits can be 
used to generate recombinant molecules which express the telomerase or its subunits. 

It will be appreciated by those skilled in the art that as a result of the 
degeneracy of the genetic code, a multitude of telomerase subunit protein sequences, 
some bearing minimal homology to the nucleotide sequences of any known and 
naturally occurring gene, may be produced. The invention contemplates each and 
every possible variation of nucleotide sequence that could be made by selecting 
combinations based on possible codon choices, taking into account the use of the 
codon "UGA" as encoding cysteine in E. aediculatus. Other than the exception of the 
"UGA" codon, these combinations are made in accordance with the standard triplet 
genetic code as applied to the nucleotide sequence encoding naturally occurring E. 
aediculatus telomerase, and all such variations are to be considered as being 
specifically disclosed. For example, the amino acid sequences encoded by each of the 
three open reading frames of the 43 kDa nucleotide sequence are specifically included 
(SEQ ID NOS:4-6). It is contemplated that any variant forms of telomerase subunit 
protein be encompassed by the present invention, as long as the proteins are functional 
in assays such as those described in the Examples. 

Although nucleotide sequences which encode E. aediculatus telomerase protein 
subunits and their variants are preferably capable of hybridizing to the nucleotide 
sequence of the naturally occurring sequence under appropriately selected conditions of 
stringency, it may be advantageous to produce nucleotide sequences encoding E. 
aediculatus telomerase protein subunits or their derivatives possessing a substantially 
different codon usage, including the "standard" codon usage employed by human and 
other systems. Codons may be selected to increase the rate at which expression of the 
peptide occurs in a particular prokaryotic or eukaryotic expression host in accordance 
with the frequency with which particular codons are utilized by the host. Other 
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reasons for substantially altering the nucleotide sequence encoding telomerase subunits 
and their derivatives without altering the encoded amino acid sequences include the 
production of RNA transcripts having more desirable properties, such as a greater or a 
shorter half-life, than transcripts produced from the naturally occurring sequence. 

It is now possible to produce a DNA sequence, or portions thereof, encoding 
telomerase protein subunits and their derivatives entirely by synthetic chemistry, after 
which the synthetic gene may be inserted into any of the many available DNA vectors 
and cell systems using reagents that are well known in the art. Moreover, synthetic 
chemistry may be used to introduce mutations into a sequence encoding E. aediculatus 
protein subunits or any portion thereof, as well as sequences encoding yeast or human 
telomerase proteins, subunits, or any portion thereof 

Also included within the scope of the present invention are polynucleotide 
sequences that are capable of hybridizing to the nucleotide sequence of Figures 9, 11, 
12, and 26, under various conditions of stringency. Hybridization conditions are based 
on the melting temperature (TJ of the nucleic acid binding complex or probe, as 
taught in Berger and Kimmel (Berger and Kimmel, Guide to Molecular Cloning 
Techniques, Meth. Enzymol., vol. 152, Academic Press, San Diego CA [1987]) 
incorporated herein by reference, and may be used at a defined "stringency". 

Altered nucleic acid sequences encoding telomerase protein subunits which may 
be used in accordance with the invention include deletions, insertions or substitutions 
of different nucleotides resulting in a polynucleotide that encodes the same or a 
functionally equivalent telomerase subunit. The protein may also show deletions, 
insertions or substitutions of amino acid residues which produce a silent change and 
resuh in a functionally equivalent telomerase subunit. Deliberate amino acid 
substitutions may be made on the basis of similarity in polarity, charge, solubility, 
hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues as long as 
the biological activity of the telomerase subunit is retained. For example, negatively 
charged amino acids include aspartic acid and glutamic acid; positively charged amino 
acids include lysine and arginine; and amino acids with uncharged polar head groups 
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having similar hydrophilicity values include leucine, isoleucine, valine; glycine, 
alanine; asparagine, glutamine; serine, threonine; and phenylalanine, tyrosine. 

Methods for DNA sequencing are well known in the art and employ such 
enzymes as the Klenow fragment of DNA polymerase I, Sequenase® (US Biochemical 
Corp, Cleveland OH), Taq DNA polymerase (Perkin Elmer, Norwalk CT), 
thermostable T7 polymerase (Amersham, Chicago IL), or combinations of recombinant 
polymerases and proofreading exonucleases such as the ELONGASE Amplification 
System marketed by Gibco BRL (Gaithersburg MD). Preferably, the process is 
automated with machines such as the Hamilton Micro Lab 2200 (Hamilton, Reno NV), 
Peltier Thermal Cycler (PTC200; MJ Research, Watertown MA) and the ABI 377 
DNA sequencers (Perkin Elmer). 

Also included within the scope of the present invention are alleles encoding 
human telomerase proteins and subunits. As used herein, the term "allele" or "allelic 
sequence" is an alternative form of the nucleic acid sequence encoding human 
telomerase proteins or subunits. Alleles resuk from mutations {i.e„ changes in the 
nucleic acid sequence), and generally produce altered mRNAs or polypeptides whose 
structure and/or function may or may not be altered. An given gene may have no, one 
or many allelic forms. Common mutational changes that give rise to alleles are 
generally ascribed to natural deletions, additions, or substitutions of amino acids. Each 
of these types of changes may occur alone, or in combination with the others, one or 
more times within a given sequence. 

Human Telomerase Motifs 

The present invention also provides nucleic and amino acid sequence 
information for human telomerase motifs. These sequences were first identified in a 
BLAST search conducted using the Euplotes 123 kDa peptide, and a homologous 
sequence from Schizosaccharomyces, designated as "tezl." Figure 25 shows the 
sequence alignment of the Euplotes ("pl23"), Schizosaccharomyces ("tezl"), Est2p 
{i.e., the S. cerevisiae protein encoded by the Est2 nucleic acid sequence, and also 
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referred to herein as "L8543.12"), and the human homolog identified in this 
comparison search. The amino acid sequence of this aligned portion is provided in 
SEQ ID N0:61 (the cDNA sequence is provided in SEQ ID NO:62), while the portion 
of tezl shown in Figure 25 is provided in SEQ ID NO:63. The portion of Est2 shown 
5 in this Figure is also provided in SEQ ID NO:64, while the portion of pl23 shown is 

also provided in SEQ ID NO:65. 

As shown in Figure 25, there are regions that are highly conserved among these 
proteins. For example, as shown in this Figure, there are regions of identity in "Motif 
0," "Motif 1, "Motif 2," and "Motif 3." The identical amino acids are indicated with 
10 an asterisk (*), while the similar amino acid residues are indicated by a circle (•). 

This indicates that there are regions within the telomerase motifs that are conserved 
among a wide variety of eukaryotes, ranging from yeast to ciliates, to humans. It is 
contemplated that additional organisms will likewise contain such conserved regions of 
sequence. 

15 Figure 27 shows the amino acid sequence of the cDNA clone encoding human 

telomerase motifs (SEQ ID NO:67), while Figure 28 shows the DNA sequence of the 
clone. Figure 29 shows the amino acid sequence of tezl (SEQ ID NO:68), while 
Figure 30 shows the DNA sequence of tezl (SEQ ID NO:69). In Figure 30, the 
introns and other non-coding regions are shown in lower case, while the axons (i.e., 

20 coding regions are shown in upper case 

Extending The Polynucleotide Sequence 

The polynucleotide sequence encoding telomerase, or telomerase protein 
subunits, or their functional equivalents, may be extended utilizing partial nucleotide 
sequence and various methods known in the art to detect upstream sequences such as 
25 promoters and regulatory elements. For example, Gobinda et al. (Gobinda et al, PGR 

Meth. Apphc. 2:318-22 [1993]) describe "restriction-site" polymerase chain reaction 
(PGR) as a direct method which uses universal primers to retrieve unknown sequence 
adjacent to a known locus. First, genomic DNA is amplified in the presence of primer 
to a linker sequence and a primer specific to the known region. The amplified 
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sequences are subjected to a second round of PGR with the same hnker primer and 
another specific primer internal to the first one. Products of each round of PGR are 
transcribed with an appropriate RNA polymerase and sequenced using reverse 
transcriptase. 

5 Inverse PGR can be used to amplify or extend sequences using divergent 

primers based on a known region (Triglia et al, Nucleic Acids Res 16:8186 [1988]). 
The primers may be designed using OLIGO® 4.06 Primer Analysis Software (National 
Biosciences Inc, Plymouth MN [1992]), or another appropriate program, to be 22-30 
nucleotides in length, to have a GC content of 50% or more, and to anneal to the 
10 target sequence at temperatures about 68°-72°G. The method uses several restriction 

enzymes to generate a suitable fragment in the known region of a gene. The fragment 
is then circularized by intramolecular ligation and used as a PGR template. 

Gapture PGR (Lagerstrom et al. PGR Methods Applic 1:111-19 [1991]), a 
method for PGR amplification of DNA fragments adjacent to a known sequence in 
15 human and yeast artificial chromosome DNA, may also be used. Gapture PGR also 

requires multiple restriction enzyme digestions and ligations to place an engineered 
double-stranded sequence into an unknown portion of the DNA molecule before PGR. 

Another method which may be used to retrieve unknown sequence is walking 
PGR (Parker et al. Nucleic Acids Res 19:3055-60 [1991]), a method for targeted gene 
20 walking. Alternatively, PGR, nested primers, PromoterFinder™ (Glontech, Palo Alto 

GA) and PromoterFinder libraries can be used to walk in genomic DNA. This process 
avoids the need to screen libraries and is useful in finding intron/exon junctions. 

Preferred libraries for screening for full length cDNAs are ones that have been 
size-selected to include larger cDNAs. Also, random primed libraries are preferred in 
25 that they will contain more sequences which contain the 5' and upstream regions of 

genes. A randomly primed library may be particularly useful if an oligo d(T) library 
does not yield a full-length cDNA. Genomic libraries are useful for extension into the 
5' nontranslated regulatory region. 

Gapillary electrophoresis may be used to analyze either the size or confirm the 
30 nucleotide sequence in sequencing or PGR products. Systems for rapid sequencing are 



available from Perkin Elmer, Beckman Instruments (FuUerton CA), and other 
companies. Capillary sequencing may employ flowable polymers for electrophoretic 
separation, four different fluorescent dyes (one for each nucleotide) which are laser 
activated, and detection of the emitted wavelengths by a charge coupled devise camera. 
Output/light intensity is converted to electrical signal using appropriate software (e.g., 
Genotyper™ and Sequence Navigator™ from Perkin Elmer) and the enthe process 
from loading of samples to computer analysis and electronic data display is computer 
controlled. Capillary electrophoresis is particularly suited to the sequencing of small 
pieces of DNA which might be present in limited amounts in a particular sample. The 
reproducible sequencing of up to 350 bp of M13 phage DNA in 30 min has been 
reported (Ruiz-Martinez et al, Anal Chem 65:2851-8 [1993]). 

Expression of the Nucleotide Sequence 

In accordance with the present invention, polynucleotide sequences which 
encode telomerase, telomerase protein subunits, or their functional equivalents, may be 
used in recombinant DNA molecules that direct the expression of telomerase or 
telomerase subunits by appropriate host cells. 

The nucleotide sequences of the present invention can be engineered in order to 
alter either or both telomerase subunits for a variety of reasons, including but not 
limited to, alterations which modify the cloning, processing and/or expression of the 
gene product. For example, mutations may be introduced using techniques which are 
well known in the art {e.g., site-directed mutagenesis to insert new restriction sites, to 
alter glycosylation patterns, to change codon preference, to produce splice variants, 
etc.). 

In an alternate embodiment of the invention, the sequence encoding the 
telomerase subunit(s) may be synthesized, whole or in part, using chemical methods 
well known in the art {See e.g, Caruthers et al. Nucleic Acids Res. Symp. Ser., 215- 
223 [1980]; and Horn et al. Nucleic Acids Res. Symp. Ser., 225-232 [1980]). 
Alternatively, the protein itself could be produced using chemical methods to 
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synthesize a telomerase subimit amino acid sequence, in whole or in part. For 
example, peptide synthesis can be performed using various solid-phase techniques 
(Roberge, et al. Science 269:202 [1995]) and automated synthesis may be achieved, 
for example, using the ABI 431 A Peptide Synthesizer (Perkin Elmer) in accordance 
with the instructions provided by the manufacturer. 

The newly synthesized peptide can be substantially purified by preparative high 
performance liquid chromatography {e.g., Creighton, Proteins, Structures and 
Molecular Principles, WH Freeman and Co, New York NY [1983]). The composition 
of the synthetic peptides may be confirmed by amino acid analysis or sequencing {e.g., 
the Edman degradation procedure; Creighton, supra). Additionally the amino acid 
sequences of telomerase subunit proteins, or any part thereof, may be altered during 
direct synthesis and/or combined using chemical methods with sequences from other 
proteins, or any part thereof, to produce a variant polypeptide. 

Expression Systems 

In order to express a biologically active telomerase protein subunit, the 
nucleotide sequence encoding the subunit or the functional equivalent, is inserted into 
an appropriate expression vector {i.e., a vector which contains the necessary elements 
for the transcription and translation of the inserted coding sequence). In order to 
express a biologically active telomerase enzyme, the nucleotide sequence encoding the 
telomerase protein subunits are inserted into appropriate expression vectors and the 
nucleotide sequence encoding the telomerase RNA subunit is inserted into the same or 
another vector for RNA expression. The protein and RNA subunits are then either 
expressed in the same cell or expressed separately, and then mixed to achieve a 
reconstituted telomerase. 

Methods which are well known to those skilled in the art can be used to 
construct expression vectors containing a telomerase protein subunit sequence and 
appropriate transcriptional or translational controls. These methods include in vitro 
recombinant DNA techniques, synthetic techniques and in vivo recombination or 
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genetic recombination. Such techniques are described in Sambrook et al. (Sambrook 
et al, Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview 
NY [1989]), and Ausubel et al. (Ausubel et al. Current Protocols in Molecular 
Biology, John Wiley & Sons, New York NY [1989]). These same methods may be 
used to convert the UGA codons, which encode cysteine in Euplotes, to the UGU or 
UGC codon for cysteine recognized by the host expression system. 

A variety of expression vector/host systems may be utilized to contain and 
express a telomerase subunit-encoding sequence. These include but are not limited to 
microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid 
or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; 
insect cell systems infected with virus expression vectors (e.g., baculovirus); plant cell 
systems transfected with virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or transformed with bacterial expression vectors 
(e.g., Ti or pBR322 plasmid); or animal cell systems. 

The "control elements" or "regulatory sequences" of these systems vary in their 
strength and specificities and are those non-translated regions of the vector, enhancers, 
promoters, and 3' untranslated regions, which interact with host cellular proteins to 
carry out transcription and translation. Depending on the vector system and host 
utilized, any number of suitable transcription and translation elements, including 
constitutive and inducible promoters, may be used. For example, when cloning in 
bacterial systems, inducible promoters such as the hybrid lacZ promoter of the 
Bluescript® phagemid (Stratagene, La JoUa CA) or pSportl (Gibco BRL) and ptrp-lac 
hybrids and the like may be used. The baculovirus polyhedrin promoter may be used 
in insect cells. Promoters or enhancers derived from the genomes of plant cells (e.g., 
heat shock, RUBISCO; and storage protein genes) or from plant viruses (e.g., viral 
promoters or leader sequences) may be cloned into the vector. In mammalian cell 
systems, promoters from the mammalian genes or from mammalian viruses are most 
appropriate. If it is necessary to generate a cell luie that contains multiple copies of 
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the sequence encoding telomerase or telomerase protein subunits, vectors based on 
SV40 or EBV may be used with an appropriate selectable marker. 

In bacterial systems, a number of expression vectors may be selected depending 
upon the use intended for the telomerase protein or subunit. For example, when large 
quantities of telomerase protein, subunit, or peptides, are needed for the induction of 
antibodies, vectors which direct high level expression of fusion proteins that are 
readily purified may be desirable. Such vectors include, but are not limited to, the 
multifunctional E. coli cloning and expression vectors such as Bluescript® 
(Stratagene), in which the sequence encoding the telomerase or protein subunit may be 
ligated into the vector in frame with sequences for the amino-terminal Met and the 
subsequent 7 residues of p-galactosidase so that a hybrid protein is produced {e.g., pIN 
vectors; Van Heeke and Schuster, J. Biol. Chem., 264:5503-5509 [1989]) and the like. 
pGEX vectors (Promega, Madison WI) may also be used to express foreign 
polypeptides as fusion proteins with glutathione S-transferase (GST). In general, such 
fusion proteins are soluble and can easily be purified from lysed cells by adsorption to 
glutathione-agarose beads followed by elution in the presence of free glutathione. 
Proteins made in such systems are designed to include heparin, thrombin or factor Xa 
protease cleavage sites so that the cloned polypeptide of interest can be released from 
the GST moiety at will. 

In the yeast, Saccharomyces cerevisiae, a number of vectors containing 
constitutive or inducible promoters such as alpha factor, alcohol oxidase and PGH may 
be used. For reviews, see Ausubel et al. {supra) and Grant et al, Meth. Enzymol., 
153:516-544 (1987). 

In cases where plant expression vectors are used, the expression of a sequence 
encoding telomerase or protein subunit, may be driven by any of a number of 
promoters. For example, viral promoters such as the 35S and 19S promoters of CaMV 
(Brisson et al. Nature 310:511-514 [1984]) may be used alone or in combination with 
the omega leader sequence from TMV (Takamatsu et al, EMBO J., 6:307-311 
[1987]). Alternatively, plant promoters such as the small subunit of RUBISCO 
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(Coruzzi et al. EMBO J., 3:1671-1680 [1984]; Broglie et al, Science 224:838-843 
[1984]) or heat shock promoters (Winter and Sinibaldi Results Probl. Cell Differ., 
17:85-105 [1991]) may be used. These constructs can be introduced into plant cells by 
direct DNA transformation or pathogen-mediated transfection (for reviews of such 
techniques, see Hobbs or Murry, in McGraw Hill Yearbook of Science and Technology 
McGraw Hill New York NY, pp. 191-196 [1992]; or Weissbach and Weissbach, 
Methods for Plant Molecular Biology, Academic Press, New York NY, pp. 421-463 
[1988]). 

An alternative expression system which could be used to express telomerase or 
telomerase protein subunit is an insect system. In one such system, Autographa 
californica nuclear polyhedrosis virus (AcNPV) is used as a vector to express foreign 
genes in Spodoptera frugiperda cells or in Trichoplusia larvae. The sequence 
encoding the telomerase sequence of interest may be cloned into a nonessential region 
of the virus, such as the polyhedrin gene, and placed under control of the polyhedrin 
promoter. Successful insertion of the sequence encoding the telomerase protein or 
telomerase protein subunit will render the polyhedrin gene inactive and produce 
recombinant virus lacking coat protein. The recombinant viruses are then used to 
infect S. frugiperda cells or Trichoplusia larvae in which the telomerase sequence is 
expressed (Smith et al, J. Virol., 46:584 [1983]; Engelhard et al, Proc. Natl. Acad. 
Sci. 91:3224-7 [1994]). 

In mammalian host cells, a number of viral-based expression systems may be 
utilized. In cases where an adenovirus is used as an expression vector, a sequence 
encoding telomerase protein or telomerase protein subunit, may be ligated into an 
adenovirus transcription/ translation complex consisting of the late promoter and 
tripartite leader sequence. Insertion in a nonessential El or E3 region of the viral 
genome will result in a viable virus capable of expressing in infected host cells (Logan 
and Shenk, Proc. Natl. Acad. Sci., 81:3655-59 [1984]). In addition, transcription 
enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used to increase 
expression in mammalian host cells. 
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Specific initiation signals may also be required for efficient translation of a 
sequence encoding telomerase protein subunits. These signals include the ATG 
initiation codon and adjacent sequences. In cases where the sequence encoding a 
telomerase protein subunit, its initiation codon and upstream sequences are inserted 
into the most appropriate expression vector, no additional translational control signals 
may be needed. However, in cases where only coding sequence, or a portion thereof, 
is inserted, exogenous transcriptional control signals including the ATG initiation 
codon must be provided. Furthermore, the initiation codon must be in the correct 
reading frame to ensure transcription of the entire insert. Exogenous transcriptional 
elements and initiation codons can be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers 
appropriate to the cell system in use (Scharf et al. Results Probl. Cell Differ., 20:125 
[1994]; and Bittner et al, Meth. Enzymol., 153:516 [1987). 

In addition, a host cell strain may be chosen for its ability to modulate the 
expression of the inserted sequences or to process the expressed protein in the desired 
fashion. Such modifications of the polypeptide include, but are not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. 
Post-translational processing which cleaves a "prepro" form of the protein may also be 
important for correct insertion, folding and/or function. Different host cells such as 
CHO (ATCC CCL 61 and CRL 9618), HeLa (ATCC CCL 2), MDCK (ATCC CCL 34 
and CRL 6253), HEK 293 (ATCC CRL 1573), WI-38 (ATCC CCL 75) (ATCC: 
American Type Culture Collection, Rockville, MD), etc have specific cellular 
machinery and characteristic mechanisms for such post-translational activities and may 
be chosen to ensure the correct modification and processing of the introduced, foreign 
protein. 

For long-term, high-yield production of recombinant proteins, stable expression 
is preferred. For example, cell lines which stably express telomerase or a telomerase 
subunit protein may be transformed using expression vectors which contain viral 
origins of replication or endogenous expression elements and a selectable marker gene. 
Following the introduction of the vector, cells may be allowed to grow for 1-2 days in 
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an enriched media before they are switched to selective media. The purpose of the 
selectable marker is to confer resistance to selection, and its presence allows growth 
and recovery of cells which successfully express the introduced sequences. Resistant 
clumps of stably transformed cells can be proliferated using tissue culture techniques 
5 appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. 
These include, but are not limited to, the herpes simplex virus thymidine kinase 
(Wigler et al., Cell 11:223-32 [1977]) and adenine phosphoribosyltransferase (Lowy et 
al, Cell 22:817 [1980]) genes which can be employed in tk- or aprt- cells, 

10 respectively. Also, antimetabolite, antibiotic or herbicide resistance can be used as the 

basis for selection; for example, dhjr which confers resistance to methotrexate (Wigler 
et al, Proc. Natl. Acad. Sci., 77:3567 [1980]); npt, which confers resistance to the 
aminoglycosides neomycin and G-418 (Colbere-Garapin et al., J. Mol. Biol, 150:1 
[1981]) and als or pat, which confer resistance to chlorsulfuron and phosphinotricin 

15 acetyltransferase, respectively (Murry, In McGraw Hill Yearbook of Science and 

Technology, McGraw Hill, New York NY, pp 191-196, [1992]). Additional selectable 
genes have been described, for example, trpB, which allows cells to utilize indole in 
place of tryptophan, or hisD, which allows cells to utilize histinol in place of histidine 
(Hartman and Mulligan, Proc. Natl. Acad. Sci., 85:8047 [1988]). Recently, the use of 

20 visible markers has gained popularity with such markers as anthocyanins, p 

glucuronidase and its substrate, GUS, and luciferase and its substrate, luciferin, being 
widely used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system (Rhodes et 
al, Meth. Mol. Biol., 55:121 [1995]). 



25 Identification of Transformants Containing the Polynucleotide Sequence 

Although the presence/absence of marker gene expression suggests that the 
gene of interest is also present, its presence and expression should be confirmed. For 
example, if the sequence encoding a telomerase protein subunit is inserted within a 
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marker gene sequence, recombinant cells containing the sequence encoding the 
telomerase protein subunit can be identified by the absence of marker gene function. 
Alternatively, a marker gene can be placed in tandem with the sequence encoding 
telomerase protein subunit xinder the control of a single promoter. Expression of the 
marker gene in response to induction or selection usually indicates expression of the 
tandem sequence as well. 

Alternatively, host cells which contain the coding sequence for telomerase or a 
telomerase protein subunit and express the telomerase or protein subunit be identified 
by a variety of procedures known to those of skill in the art. These procedures 
include, but are not limited to, DNA-DNA or DNA-RNA hybridization and protein 
bioassay or immunoassay techniques which include membrane, solution, or chip-based 
technologies for the detection and/or quantification of the nucleic acid or protein. 

The presence of the polynucleotide sequence encoding telomerase protein 
subunits can be detected by DNA-DNA or DNA-RNA hybridization or amplification 
using probes, portions, or fragments of the sequence encoding the subunit. Nucleic 
acid amplification based assays involve the use of oligonucleotides or oligomers based 
on the nucleic acid sequence to detect transformants containing DNA or RNA 
encoding the telomerase subunit. As used herein "oligonucleotides" or "oligomers" 
refer to a nucleic acid sequence of approximately 10 nucleotides or greater and as 
many as approximately 100 nucleotides, preferably between 15 to 30 nucleotides, and 
more preferably between 20-25 nucleotides which can be used as a probe or amplimer. 

A variety of protocols for detecting and measuring the expression of proteins 
(e.g., telomerase or a telomerase protein subunits) using either polyclonal or 
monoclonal antibodies specific for the protein are known in the art. Examples include 
enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA) and 
fluorescent activated cell sorting (FACS). These and other assays are described, 
among other places, in Hampton et al. Serological Methods a Laboratory Manual, 
APS Press, St Paul MN [1990]) and Maddox et al, J. Exp. Med., 158:1211 [1983]). 

A wide variety of labels and conjugation techniques are known by those skilled 
in the art and can be used in various nucleic acid and amino acid assays. Means for 
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producing labeled hybridization or PGR probes for detecting related sequences include 
oligolabeling, nick translation, end-labeling or PGR amplification using a labeled 
nucleotide. Alternatively, a telomerase protein subunit sequence, or any portion of it, 
may be cloned into a vector for the production of an mRNA probe. Such vectors are 
known in the art, are commercially available, and may be used to synthesize RNA 
probes in vitro by addition of an appropriate RNA polymerase such as T7, T3 or SP6 
and labeled nucleotides. 

A number of companies such as Pharmacia Biotech (Piscatav^^ay NJ), Promega 
(Madison WI), and US Biochemical Corp (Gleveland OH) supply commercial kits and 
protocols for these procedures. Suitable reporter molecules or labels include those 
radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents as well 
as substrates, cofactors, inhibitors, magnetic particles and the like. Patents teaching the 
use of such labels include US Patents 3,817,837; 3,850,752; 3,939,350; 3,996,345; 
4,277,437; 4,275,149 and 4,366,241, herein incorporated by reference. Also, 
recombinant immunoglobulins may be produced as shown in US Patent No. 4,816,567 
incorporated herein by reference. 



Purification of Recombinant Telomerase and Telomerase Subunit Proteins 

In addition to the method of purification described in Example 3 below, it is 
contemplated that additional methods of purifying recombinantly produced telomerase 
or telomerase protein subunits will be used. For example, host cells transformed with 
a nucleotide sequence encoding telomerase or telomerase subunit protein(s) may be 
cultured under conditions suitable for the expression and recovery of the encoded 
protein from cell culture. The protein produced by a recombinant cell may be secreted 
or contained intracellularly depending on the sequence and/or the vector used. As will 
be understood by those of skill in the art, expression vectors containing the telomerase 
or subunit protein encoding sequence can be designed with signal sequences which 
direct secretion of the telomerase or telomerase subunit protein through a prokaryotic 
or eukaryotic cell membrane. Other recombinant constructions may join the sequence 
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encoding the telomerase or subunit protein to a nucleotide sequence encoding a 
polypeptide domain. 

Telomerase or telomerase subunit protein(s) may also be expressed as 
recombinant proteins with one or more additional polypeptide domains added to 
5 facilitate protein pvirification. Such purification facilitating domains include, but are 

not limited to, metal chelating peptides such as polyhistidine tracts and 
histidine-tryptophan modules that allow purification on immobilized metals, protein A 
domains that allow purification on immobilized immunoglobulin, and the domain 
utilized in the FLAGS extension/affinity purification system (Immunex Corp, Seattle 

10 WA). The inclusion of a cleavable linker sequences such as Factor Xa or enterokinase 

(Invitrogen, San Diego CA) between the purification domain and telomerase or 
telomerase protein subunits is useful to facilitate purification. One such expression 
vector provides for expression of a fusion protein comprising the sequence encoding 
telomerase or telomerase protein subunits and nucleic acid sequence encoding 6 

15 histidine residues followed by thioredoxin and an enterokinase cleavage site. The 

histidine residues facilitate purification while the enterokinase cleavage site provides a 
means for purifying the telomerase or telomerase protein subunit from the fusion 
protein. Literature pertaining to vectors containing fusion proteins is available in the 
art {See e.g., Kroll et al, DNA Cell. Biol., 12:441-53 [1993]). 

20 In addition to recombinant production, fragments of telomerase subunit protein 

may be produced by direct peptide synthesis using solid-phase techniques (See e.g., 
Merrifield, J. Am. Chem. Soc, 85:2149 [1963]). In vitro protein synthesis may be 
performed using manual techniques or by automation. Automated synthesis may be 
achieved, for example, using Applied Biosystems 431 A Peptide Synthesizer (Perkin 

25 Elmer, Foster City CA) in accordance with the instructions provided by the 

manufacturer. Various fragments of telomere protein subunit may be chemically 
synthesized separately and combined using chemical methods to produce the fiiU 
length molecule. 
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Uses of Telomerase and Telomerase Subunit Proteins 

The rationale for use of the nucleotide and peptide sequences disclosed herein 
is based in part on the homology between the E. aediculatus telomerase 123 kDa 
protein subunit, the yeast protein L8543.12 (Est2), Schizosaccharomyces, and the 
5 human motifs observed during the development of the present invention. In particular, 

the yeast and 123 kDa protein contain the reverse transcriptase motif in their C- 
terminal regions, they share similarity in regions outside the reverse transcriptase 
motif, they are similarly basic (with a pi of 10.1 for the 123 kDa protein, and of 10.0 
for the yeast), and they are both large (123 kDa and 103 kDa). Furthermore, in view 

10 of the reverse transcriptase motifs, these subunits are believed to comprise the catalytic 

core of their respective telomerases. Indeed, the reverse transcriptase motifs of the 123 
kDa E. aediculatus telomerase protein subunit is shown in the present invention to be 
useful for the identification of similar sequences in other organisms. 

As E. aediculatus and S. cerevisiae are so phylogenetically distant, it is 

15 contemplated that this homology provides a strong basis for predicting that human and 

other telomerases will contain a protein that is large, basic, and includes such reverse 
transcriptase motifs. Indeed, motifs have been identified within a clone encoding the 
human homolog of the telomerase protein. It is further contemplated that this protein 
is essential for human telomerase catalytic activity. This observation should prove 

20 valuable for amplification of the human telomerase gene by PGR or other methods, for 

screening for telomerase sequences in human and other animals, as well as for 
prioritizing candidate telomerase proteins or genes identified by genetic, biochemical, 
or nucleic acid hybridization methods. It is also contemplated that the telomerase 
proteins of the present invention will find use in tailing DNA 3' ends in vitro. 

25 It is contemplated that expression of telomerase and/or telomerase subunit 

proteins in cell lines will find use in the development of diagnostics for tumors and 
aging factors. The nucleotide sequence may be used in hybridization or PGR 
technologies to diagnose the induced expression of messenger RNA sequences early in 
the disease process. Likewise the protein can be used to produce antibodies useful in 
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ELISA assays or a derivative diagnostic format. Such diagnostic tests may allow 
different classes of human tumors or other cell-proliferative diseases to be 
distinguished and thereby facilitate the selection of appropriate treatment regimens. 
It is contemplated that the finding of the reverse transcriptase motifs in the 
5 telomerase proteins of the present invention will be used to develop methods to test 

known and yet to be described reverse transcriptase inhibitors, including nucleosides, 
and non-nucleosides for anti-telomerase activity. 

It is contemplated that the amino acid sequence motifs disclosed herein will 
lead to the development of drugs (e.g., telomerase inhibitors) useful in humans and/or 

10 other animals, that will arrest cell division in cancers or other disorders characterized 

by proliferation of cells. It is also contemplated that the telomerase proteins will find 
use in methods for targeting and directing RNA or RNA-tethered drugs to specific sub- 
cellular compartments such as the nucleus or sub-nuclear organelles, or to telomeres. 
In one embodiment of the diagnostic method of the present invention, normal 

15 or standard values for telomerase mRNA expression are established as a baseline. This 

can be accomplished by a number of assays such as quantitating the amount of 
telomerase mRNA in tissues taken from normal subjects, either animal or human, with 
nucleic probes derived fi-om the telomerase or telomerase protein subunit sequences 
provided herein (either DNA or RNA forms) using techniques which are well known 

20 in the art (e.g.. Southern blots. Northern blots, dot or slot blots). The standard values 

obtained from normal samples may be compared with values obtained from samples 
fi-om subjects potentially affected by disease (e.g., tumors or disorders related to 
aging). Deviation between standard and subject values can establish the presence of a 
disease state. In addition, the deviation can indicate, within a disease state, a particular 

25 clinical outcome (e.g., metastatic or non-metastatic). 

The nucleotide sequence encoding telomerase or telomerase protein subvmits is 
useful when placed in an expression vector for making quantities of protein for 
therapeutic use. The antisense nucleotide sequence of the telomerase gene is 
potentially useful in vectors designed for gene therapy directed at neoplasia including 

30 metastases. Additionally, the inhibition of telomerase expression may be useful in 




detecting the development of disturbances in the aging process or problems occurring 
during chemotherapy . Alternatively, the telomerase or telomerase protein subvmit 
encoding nucleotide sequences may used to direct the expression of telomerase or 
subvinits in situations where it is desirable to increase the amount of telomerase 
5 activity. 

Telomere Subunit Protein Antibodies 

It is contemplated that antibodies directed against the telomerase subunit 
proteins will find use in the diagnosis and treatment of conditions and diseases 
associated with expression of telomerase (including the over-expression and the 

10 absence of expression). Such antibodies include, but are not limited to, polyclonal, 

monoclonal, chimeric, single chain, Fab fragments and fragments produced by a Fab 
expression library. Given the phylogenetic conservation of the reverse transcriptase 
motif in the 123 kDa subunit of the Euplotes telomerase, it is contemplated that 
antibodies directed against this subunit may be useful for the identification of 

15 homologous subunits in other organisms, including humans. It is further contemplated 

that antibodies directed against the motifs provided in the present invention will find 
use in treatment and/or diagnostic areas. 

Telomerase subunit proteins used for antibody induction need not retain 
biological activity; however, the protein fragment, or oligopeptide must be 

20 immunogenic, and preferably antigenic. Peptides used to induce specific antibodies 

may have an amino acid sequence consisting of at least five amino acids, preferably at 
least 10 amino acids. Preferably, they should mimic a portion of the amino acid 
sequence of the natural protein and may contain the entire amino acid sequence of a 
small, naturally occurring molecule. Short stretches of telomerase subimit protein 

25 amino acids may be fused with those of another protein such as keyhole limpet 

hemocyanin and antibody produced against the chimeric molecule. Complete 
telomerase used for antibody induction can be produced by co-expression of protein 
and RNA components in cells, or by reconstitution in vitro from components 
separately expressed or synthesized. 




For the production of antibodies, various hosts including goats, rabbits, rats, 
mice, etc may be immunized by injection with telomerase protein, protein subunit, or 
any portion, fragment or ohgopeptide which retains immunogenic properties. 
Depending on the host species, various adjuvants may be used to increase 
5 immunological response. Such adjuvants are commercially available, and include but 

are not limited to Freund's, mineral gels such as aluminum hydroxide, and surface 
active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, keyhole limpet hemocyanin, and dinitrophenol. BCG (Bacillus 
Calmette-Guerin) and Corynebacterium parvum are potentially useful adjuvants. 

10 Monoclonal antibodies to telomerase or telomerase protein subunits be prepared 

using any technique which provides for the production of antibody molecules by 
continuous cell lines in culture. These include but are not limited to the hybridoma 
technique originally described by Koehler and Milstein (Koehler and Milstein, Nature 
256:495-497 [1975]), the human B-cell hybridoma technique (Kosbor et al, Immvmol. 

15 Today 4:72 [1983]; Cote et al, Proc. Natl. Acad. Sci., 80:2026-2030 [1983]) and the 

EBV-hybridoma technique (Cole et al. Monoclonal Antibodies and Cancer Therapy, 
Alan R Liss Inc, New York NY, pp 77-96 [1985]). 

Antibodies may also be produced by inducing in vivo production in the 
lymphocyte population or by screening recombinant immunoglobulin libraries or 

20 panels of highly specific binding reagents as disclosed in Orlandi et al (Orlandi et 

a/., Proc. Natl. Acad. Sci., 86: 3833 [1989]; and Winter and Milstein, Nature 349:293 
[1991]). 

Antibody fragments which contain specific binding sites for telomerase or 
telomerase protein subunits may also be generated. For example, such fragments 
25 include, but are not limited to, the F(ab')2 fragments which can be produced by pepsin 

digestion of the antibody molecule and the Fab fragments which can be generated by 
reducing the disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression 
libraries may be constructed to allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity (Huse et al. Science 256:1275 [1989]). 



- 48 - 




A variety of protocols for competitive binding or immvmoradiometric assays 
using either polyclonal or monoclonal antibodies with established specificities are well 
known in the art. Such immunoassays typically involve the formation of complexes 
between telomerase or telomerase protein subunit and its specific antibody and the 
5 measurement of complex formation. A two-site, monoclonal-based immunoassay 

utilizing monoclonal antibodies reactive to two noninterfering epitopes on a specific 
telomerase protein subunit is preferred in some situations, but a competitive binding 
assay may also be employed {See e.g., Maddox et al, J. Exp. Med., 158:1211 [1983]). 
Peptides selected from the group comprising the sequences shown in Figure 32 

10 are used to generate polyclonal and monoclonal antibodies specifically directed against 
human and other telomerase proteins. The peptides are useful for inhibition of 
protein-RNA, protein-protein interaction within the telomerase complex, and protein- 
DNA interaction at telomeres. Antibodies produced against these peptides are then 
used in various settings, including but not limited to anti-cancer therapeutics capable of 

15 inhibiting telomerase activity, for purification of native telomerase for therapeutics, for 

purification and cloning other components of human telomerase and other proteins 
associated with human telomerase, and diagnostic reagents. 

Diagnostic Assays Using Telomerase Specific Antibodies 

Particular telomerase and telomerase protein subunit antibodies are useful for 
20 the diagnosis of conditions or diseases characterized by expression of telomerase or 

telomerase protein subunits, or in assays to monitor patients being treated with 
telomerase, its fragments, agonists or inhibitors (including antisense transcripts capable 
of reducing expression of telomerase). Diagnostic assays for telomerase include 
methods utilizing the antibody and a label to detect telomerase in human body fluids 
25 or extracts of cells or tissues. The polypeptides and antibodies of the present invention 

may be used with or without modification. Frequently, the polypeptides and 
antibodies will be labeled by joining them, either covalently or noncovalently, with a 
reporter molecule. A wide variety of reporter molecules are knovm, several of which 
were described above. In particular, the present invention is useful for diagnosis of 
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human disease, although it is contemplated that the present invention will find use in 
the veterinary arena. 

A variety of protocols for measuring telomerase protein(s) using either 
polyclonal or monoclonal antibodies specific for the respective protein are known in 
5 the art. Examples include enzyme-linked immunosorbent assay (ELISA), 

radioimmunoassay (RIA) and fluorescent activated cell sorting (FACS). A two-site, 
monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two 
non-interfering epitopes on the telomerase proteins or a subunit is preferred, but a 
competitive binding assay may be employed. These assays are described, among other 

10 places, in Maddox (Maddox et al, J. Exp. Med., 158:1211 [1983]). 

In order to provide a basis for diagnosis, normal or standard values for human 
telomerase expression must be established. This is accomplished by combining body 
fluids or cell extracts taken from normal subjects, either animal or human, with 
antibody to telomerase or telomerase subunit(s) under conditions suitable for complex 

15 formation which are well known in the art. The amount of standard complex 

formation may be quantified by comparing various artificial membranes containing 
known quantities of telomerase protein, with both control and disease samples from 
biopsied tissues. Then, standard values obtained from normal samples may be 
compared with values obtained from samples from subjects potentially affected by 

20 disease {e.g., metastases). Deviation between standard and subject values establishes 

the presence of a disease state. 

Drug Screening 

Telomerase or telomerase subunit proteins or their catalytic or immunogenic 
fragments or oligopeptides thereof, can be used for screening therapeutic compounds in 
25 any of a variety of drug screening techniques. The fragment employed in such a test 

may be free in solution, affixed to a solid support, borne on a cell surface, or located 
intracellularly. The formation of binding complexes, between telomerase or the 
subunit protein and the agent being tested, may be measured. 
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Another technique for drug screening which may be used for high throughput 
screening of compounds having suitable binding affinity to the telomerase or 
telomerase protein subunit is described in detail in "Determination of Amino Acid 
Sequence Antigenicity" by Geysen, (Geysen, WO Application 84/03564, published on 
5 September 13, 1984, incorporated herein by reference). In summary, large numbers 

of different small peptide test compounds are synthesized on a solid substrate, such as 
plastic pins or some other surface. The peptide test compounds are reacted with 
fragments of telomerase or telomerase protein subunits and washed. Bound telomerase 
or telomerase protein subunit is then detected by methods well known in the art. 

10 Substantially purified telomerase or telomerase protein subunit can also be coated 

directly onto plates for use in the aforementioned drug screenuig techniques. 
Alternatively, non-neutralizing antibodies can be used to capture the peptide and 
immobilize it on a solid support. 

This invention also contemplates the use of competitive drug screening assays 

15 in which neutralizing antibodies capable of binding telomerase or subunit protein(s) 

specifically compete with a test compound for binding telomerase or the subunit 
protein. In this manner, the antibodies can be used to detect the presence of any 
peptide which shares one or more antigenic determinants with the telomerase or 
subunit protein. 

20 Uses of the Polynucleotides Encoding Telomerase Subunit Proteins 

A polynucleotide sequence encoding telomerase subvinit proteins or any part 
thereof may be used for diagnostic and/or therapeutic purposes. For diagnostic 
purposes, the sequence encoding telomerase subunit protein of this invention may be 
used to detect and quantitate gene expression of the telomerase or subunit protein. 
25 The diagnostic assay is useful to distinguish between absence, presence, and excess 

expression of telomerase, and to monitor regulation of telomerase levels during 
therapeutic intervention. Included in the scope of the invention are oligonucleotide 
sequences, antisense RNA and DNA molecules, and PNAs. 
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Another aspect of the subject invention is to provide for hybridization or PGR 
probes which are capable of detecting polynucleotide sequences, including genomic 
sequences, encoding telomerase subunit proteins or closely related molecules. The 
specificity of the probe, whether it is made from a highly specific region (e.g., 10 
5 unique nucleotides in the 5' regulatory region), or a less specific region (e.g., 

especially in the 3' region), and the stringency of the hybridization or amplification 
(maximal, high, intermediate or low) will determine whether the probe identifies only 
naturally occurring telomerase, telomerase subunit proteins or related sequences. 

Probes may also be used for the detection of related sequences and should 
10 preferably contain at least 50% of the nucleotides from any of these telomerase subunit 
protein sequences. The hybridization probes of the subject invention may be derived 
from the nucleotide sequence provided by the present invention (e.g., SEQ ID N0:1, 
3, 62, 66, or 69), or from genomic sequence including promoter, enhancer elements 
and introns of the naturally occurring sequence encoding telomerase subunit proteins. 
15 Hybridization probes may be labeled by a variety of reporter groups, including 

commercially available radionuclides such as ^^P or ^^S, or enzymatic labels such as 
alkaline phosphatase coupled to the probe via avidin^iotin coupling systems, and the 
like. 

Other means for producing specific hybridization probes for DNAs include the 
20 cloning of nucleic acid sequences encoding telomerase subunit proteins or derivatives 

into vectors for the production of mRNA probes. Such vectors are known in the art 
and are commercially available and may be used to synthesize RNA probes in vitro by 
means of the addition of the appropriate RNA polymerase as T7 or SP6 RNA 
polymerase and the appropriate radioactively labeled nucleotides. 

25 Diagnostic Use 

Polynucleotide sequences encoding telomerase may be used for the diagnosis of 
conditions or diseases with which the abnormal expression of telomerase is associated. 
For example, polynucleotide sequences encoding human telomerase may be used in 
hybridization or PGR assays of fluids or tissues from biopsies to detect telomerase 




expression. The form of such quaUtative or quantitative methods may include 
Southern or northern analysis, dot blot or other membrane-based technologies; PGR 
technologies; dip stick, pin, chip and ELISA technologies. All of these techniques are 
well known in the art and are the basis of many commercially available diagnostic 
5 kits. 

The human telomerase-encoding nucleotide sequences disclosed herein provide 
the basis for assays that detect activation or induction associated with disease 
(including metastasis); in addition, the lack of expression of human telomerase may be 
detected using the human and other telomerase-encoding nucleotide sequences 

10 disclosed herein. The nucleotide sequence may be labeled by methods known in the 
art and added to a fluid or tissue sample from a patient under conditions suitable for 
the formation of hybridization complexes. After an incubation period, the sample is 
washed with a compatible fluid which optionally contains a dye (or other label 
requiring a developer) if the nucleotide has been labeled with an enzyme. After the 

15 compatible fluid is rinsed off, the dye is quantitated and compared with a standard. If 

the amount of dye in the biopsied or extracted sample is significantly elevated over 
that of a comparable control sample, the nucleotide sequence has hybridized with 
nucleotide sequences in the sample, and the presence of elevated levels of nucleotide 
sequences encoding human telomerase in the sample indicates the presence of the 

20 associated disease. Alternatively, the loss of expression of human telomerase 

sequences in a tissue which normally expresses telomerase sequences indicates the 
presence of an abnormal or disease state. 

Such assays may also be used to evaluate the efficacy of a particular 
therapeutic treatment regime in animal studies, in clinical trials, or in monitoring the 

25 treatment of an individual patient. In order to provide a basis for the diagnosis of 

disease, a normal or standard profile for human telomerase expression must be 
established. This is accomplished by combining body fluids or cell extracts taken 
from normal subjects, either animal or human, with human telomerase or a portion 
thereof, under conditions suitable for hybridization or amplification. Standard 

30 hybridization may be quantified by comparing the values obtained for normal subjects 




with a dilution series of human telomerase run in the same experiment where a known 
amount of substantially purified human telomerase is used. Standard values obtained 
from normal samples may be compared with values obtained from samples from 
patients affected by telomerase-associated diseases. Deviation between standard and 
5 subject values establishes the presence of disease. 

Once disease is established, a therapeutic agent is administered and a treatment 
profile is generated. Such assays may be repeated on a regular basis to evaluate 
whether the values in the profile progress toward or return to the normal or standard 
pattern. Successive treatment profiles may be used to show the efficacy of treatment 

10 over a period of several days or several months. 

PGR, which may be used as described in US Patent Nos. 4,683,195, 4,683,202, 
and 4,965,188 (herein incorporated by reference) provides additional uses for 
oligonucleotides based upon the sequence encoding telomerase subunit proteins. Such 
oligomers are generally chemically synthesized, but they may be generated 

15 enzymatically or produced from a recombinant source. Oligomers generally comprise 

two nucleotide sequences, one with sense orientation (5'-»3') and one with antisense 
(3'-^ 5'), employed under optimized conditions for identification of a specific gene or 
condition. The same two oligomers, nested sets of oligomers, or even a degenerate 
pool of oligomers may be employed under less stringent conditions for detection 

20 and/or quantitation of closely related DNA or RNA sequences. 

Additionally, methods which may be used to quantitate the expression of a 
particular molecule include radiolabeling (Melby et al, J. Immunol. Meth., 159:235-44 
[1993]) or biotinylating [Duplaa et al, Anal. Biochem., 229-36 [1993]) nucleotides, 
co-amplification of a control nucleic acid, and standard curves onto which the 

25 experimental results are interpolated. Quantitation of multiple samples may be 

speeded up by ruiming the assay in an ELISA format where the oligomer of interest is 
presented in various dilutions and a spectrophotometric or colorimetric response gives 
rapid quantitation. A definitive diagnosis of this type may allow health professionals 
to begin aggressive treatment and prevent further worsening of the condition. 

30 Similarly, further assays can be used to monitor the progress of a patient during 




treatment. Furthermore, the nucleotide sequences disclosed herein may be used in 
molecular biology techniques that have not yet been developed, provided the new 
techniques rely on properties of nucleotide sequences that are currently known such as 
the triplet genetic code, specific base pair interactions, and the like. 

5 Therapeutic Use 

Based upon its homology to other telomerase sequences, the polynucleotide 
encoding human telomerase disclosed herein may be useful in the treatment of 
metastasis; in particular, inhibition of human telomerase expression may be therapeutic. 
Expression vectors derived from retroviruses, adenovirus, herpes or vaccinia 
10 viruses, or from various bacterial plasmids, may be used for delivery of nucleotide 

sequences (sense or antisense) to the targeted organ, tissue or cell population. Methods 
which are well known to those skilled in the art can be used to construct recombinant 
vectors which will express antisense of the sequence encoding hviman telomerase. See, 
for example, the techniques described in Sambrook et al. {supra) and Ausubel et al. 
15 {supra). 

The polynucleotides comprising full length cDNA sequence and/or its 
regulatory elements enable researchers to use the sequence encoding human 
telomerase, including the various motifs as an investigative tool in sense (Youssoufian 
and Lodish, Mol. Cell. Biol., 13:98-104 [1993]) or antisense (Eguchi et al, Ann. Rev. 

20 Biochem., 60:631-652 [1991]) regulation of gene function. Such technology is now 

well known in the art, and sense or antisense oligomers, or larger fragments, can be 
designed from various locations along the coding or control regions. 

Genes encoding human telomerase can be turned off by transfecting a cell or 
tissue with expression vectors which express high levels of a desired telomerase 

25 fragment. Such constructs can flood cells with untranslatable sense or antisense 

sequences. Even in the absence of integration into the DNA, such vectors may 
continue to transcribe RNA molecules until all copies are disabled by endogenous 
nucleases. Transient expression may last for a month or more with a non-replicating 
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vector and even longer if appropriate replication elements are part of the vector 
system. 

As mentioned above, modifications of gene expression can be obtained by 
designing antisense molecules, DNA, RNA or PNA, to the control regions of the 
5 sequence encoding human telomerase (i.e., the promoters, enhancers, and introns). 

Oligonucleotides derived from the transcription initiation site, (e.g., between -10 and 
+10 regions of the leader sequence) are preferred. The antisense molecules may also 
be designed to block translation of mRNA by preventing the transcript from binding to 
ribosomes. Similarly, inhibition can be achieved using "triple helix" base-pairing 

10 methodology. Triple helix pairing compromises the ability of the double helix to open 
sufficiently for the binding of polymerases, transcription factors, or regulatory 
molecules (for a review of recent therapeutic advances using triplex DNA, see Gee et 
al, in Huber and Carr, Molecular and Immunologic Approaches, Futura Publishing 
Co, MtKisco NY [1994]). 

15 Ribozymes are enzymatic RNA molecules capable of catalyzing the specific 

cleavage of RNA. The mechanism of ribozyme action involves sequence-specific 
hybridization of the ribozyme molecule to complementary target RNA, followed by 
endonucleolytic cleavage. Within the scope of the invention are engineered 
hammerhead motif ribozyme molecules that can specifically and efficiently catalyze 

20 endonucleolytic cleavage of the sequence encoding human telomerase. 

Specific ribozyme cleavage sites within any potential RNA target are initially 
identified by scanning the target molecule for ribozyme cleavage sites which include 
the following sequences, GUA, GUU and GUC. Once identified, short RNA 
sequences of between 15 and 20 ribonucleotides corresponding to the region of the 

25 target gene containing the cleavage site may be evaluated for secondary structural 

features which may render the oligonucleotide inoperable. The suitability of candidate 
targets may also be evaluated by testing accessibility to hybridization with 
complementary oligonucleotides using ribonuclease protection assays. 

Antisense molecules and ribozymes of the invention may be prepared by any 

30 method known in the art for the synthesis of RNA molecules. These include 



techniques for chemically synthesizing oligonucleotides such as solid phase 
phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated 
by in vitro and in vivo transcription of DNA sequences encoding human telomerase 
and/or telomerase protein subunits. Such DNA sequences may be incorporated into a 
5 wide variety of vectors with suitable RNA polymerase promoters such as T7 or SP6. 

Alternatively, antisense cDNA constructs that synthesize antisense RNA constitutively 
or inducibly can be introduced into cell lines, cells or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. 
Possible modifications include, but are not limited to, the addition of flanking 

10 sequences at the 5' and/or 3' ends of the molecule or the use of phosphorothioate or 2' 
0-methyl rather than phosphodiesterase linkages within the backbone of the molecule. 
This concept is inherent in the production of PNAs and can be extended in all of these 
molecules by the inclusion of nontraditional bases such as inosine, queosine and 
wybutosine as well as acetyl-, methyl-, thio- and similarly modified forms of adenine, 

15 cytidine, guanine, thymine, and uridine which are not as easily recognized by 
endogenous endonucleases. 

Methods for introducing vectors into cells or tissues include those methods 
discussed infra, and which are equally suitable for in vivo, in vitro and ex vivo therapy. 
For ex vivo therapy, vectors are introduced into stem cells taken from the patient and 

20 clonally propagated for autologous transplant back into that same patient is presented 
in US Patent Nos. 5,399,493 and 5,437,994, the disclosure of which is herein 
incorporated by reference. Delivery by transfection and by liposome are quite well 
known in the art. 

Furthermore, the nucleotide sequences encoding the various telomerase proteins 
25 and subunits disclosed herein may be used in molecular biology techniques that have 

not yet been developed, provided the new techniques rely on properties of nucleotide 
sequences that are currently known, including but not limited to such properties as the 
triplet genetic code and specific base pair interactions. 
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Detection and Mapping of Related Polynucleotide Sequences 
in Other Genomes 

The nucleic acid sequence encoding E. aediculatus, S. cerevisiae, S. pombe, and 
human telomerase subunit proteins and sequence variants thereof, may also be used to 
5 generate hybridization probes for mapping the naturally occurring homologous 

genomic sequence in the human and other genomes. The sequence may be mapped to 
a particular chromosome or to a specific region of the chromosome using well known 
techniques. These include in situ hybridization to chromosomal spreads, flow-sorted 
chromosomal preparations, or artificial chromosome constructions such as yeast 

10 artificial chromosomes, bacterial artificial chromosomes, bacterial PI constructions or 
single chromosome cDNA libraries as reviewed by Price (Price, Blood Rev., 7:127 
[1993]) and Trask (Trask, Trends Genet 7:149 [1991]). 

The technique of fluorescent in situ hybridization (FISH) of chromosome 
spreads has been described, among other places, in Verma et al. (Verma et al, Human 

15 Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York NY 

[1988]). Fluorescent in situ hybridization of chromosomal preparations and other 
physical chromosome mapping techniques may be correlated with additional genetic 
map data. Examples of genetic map data can be found in the 1994 Genome Issue of 
Science (265: 198 If). Correlation between the location of the sequence encoding 

20 human telomerase on a physical chromosomal map and a specific disease (or 

predisposition to a specific disease) may help delimit the region of DNA associated 
with the disease. The nucleotide sequences of the subject invention may be used to 
detect differences in gene sequences between normal, carrier or affected individuals. 
In situ hybridization of chromosomal preparations and physical mapping 

25 techniques such as linkage analysis using established chromosomal markers are 

invaluable in extending genetic maps {See e.g., Hudson et al. Science 270:1945 
[1995]). Often the placement of a gene on the chromosome of another mammalian 
species such as mouse (Whitehead Institute/MIT Center for Genome Research, Genetic 
Map of the Mouse, Database Release 10, April 28, 1995) may reveal associated 
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markers even if the number or arm of a particular human chromosome is not known. 
New sequences can be assigned to chromosomal arms, or parts thereof, by physical 
mapping. This provides valuable information to investigators searching for disease 
genes using positional cloning or other gene discovery techniques. 

5 Pharmaceutical Compositions 

The present invention also relates to pharmaceutical compositions which may 
comprise telomerase and/or or telomerase subunit nucleotides, proteins, antibodies, 
agonists, antagonists, or inhibitors, alone or in combination with at least one other 
agent, such as stabilizing compovmd, which may be administered in any sterile, 

10 biocompatible pharmaceutical carrier, including, but not limited to, saline, buffered 

saline, dextrose, and water. Any of these molecules can be administered to a patient 
alone, or in combination with other agents, drugs or hormones, in pharmaceutical 
compositions where it is mixed with suitable excipient(s), adjuvants, and/or 
pharmaceutically acceptable carriers. In one embodiment of the present invention, the 

15 pharmaceutically acceptable carrier is pharmaceutically inert. 

Administration Of Pharmaceutical Compositions 

Administration of pharmaceutical compositions is accomplished orally or 
parenterally. Methods of parenteral delivery include topical, intra-arterial {e.g., 
directly to the tumor), intramuscular, subcutaneous, intramedullary, intrathecal, 

20 intraventricular, intravenous, intraperitoneal, or intranasal administration. In addition 

to the active ingredients, these pharmaceutical compositions may contain suitable 
pharmaceutically acceptable carriers comprising excipients and other compounds that 
facilitate processing of the active compounds into preparations which can be used 
pharmaceutically. Further details on techniques for formulation and administration 

25 may be found in the latest edition of "Remington's Pharmaceutical Sciences" (Maack 

Publishing Co, Easton PA). 

Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutically acceptable carriers well known in the art in dosages suitable for oral 
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administration. Such carriers enable the pharmaceutical compositions to be formulated 
as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, etc., 
suitable for ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through combination 
5 of active compounds with solid excipient, optionally grinding a resulting mixture, and 

processing the mixture of granules, after adding suitable additional compounds, if 
desired, to obtain tablets or dragee cores. Suitable excipients are carbohydrate or 
protein fillers include, but are not limited to sugars, including lactose, sucrose, 
mannitol, or sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose 

10 such as methyl cellulose, hydroxypropylmethyl-cellulose, or sodium 

carboxymethylcellulose; and gums including arabic and tragacanth; as well as proteins 
such as gelatin and collagen. If desired, disintegrating or solubilizing agents may be 
added, such as the cross-linked polyvinyl pyrrolidone, agar, alginic acid, or a salt 
thereof, such as sodium alginate. 

15 Dragee cores are provided with suitable coatings such as concentrated sugar 

solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic 
solvents or solvent mixtures. Dyestuffs or pigments may be added to the tablets or 
dragee coatings for product identification or to characterize the quantity of active 

20 compound (i.e., dosage). 

Pharmaceutical preparations which can be used orally include push-fit capsules 
made of gelatin, as well as soft, sealed capsules made of gelatin and a coating such as 
glycerol or sorbitol. Push-fit capsules can contain active ingredients mixed with a 
filler or binders such as lactose or starches, lubricants such as talc or magnesium 

25 stearate, and, optionally, stabilizers. In soft capsules, the active compounds may be 

dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid 
polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations for parenteral administration include aqueous 
solutions of active compounds. For injection, the pharmaceutical compositions of the 

30 invention may be formulated in aqueous solutions, preferably in physiologically 
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compatible buffers such as Hanks' s solution, Ringer's solution, or physiologically 
buffered saline. Aqueous injection suspensions may contain substances which increase 
the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Additionally, suspensions of the active compounds may be prepared as 
5 appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include 

fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or 
triglycerides, or liposomes. Optionally, the suspension may also contain suitable 
stabilizers or agents which increase the solubility of the compounds to allow for the 
preparation of highly concentrated solutions. 
10 For topical or nasal administration, penetrants appropriate to the particular 

barrier to be permeated are used in the formulation. Such penetrants are generally 
known in the art. 

Manufacture And Storage 

The pharmaceutical compositions of the present invention may be manufactured 
15 in a manner that known in the art (e.g., by means of conventional mixing, dissolving, 

granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or 

lyophilizing processes). 

The pharmaceutical composition may be provided as a salt and can be formed 

with many acids, including but not limited to hydrochloric, sulfuric, acetic, lactic, 
20 tartaric, malic, succinic, etc. Salts tend to be more soluble in aqueous or other 

protonic solvents that are the corresponding free base forms. In other cases, the 

preferred preparation may be a lyophilized powder in 1 mM-50 mM histidine, 

0.1%-2% sucrose, 2%-7% mannitol at a pH range of 4.5 to 5.5, that is combined with 

buffer prior to use. 

25 After pharmaceutical compositions comprising a compound of the invention 

formulated in a acceptable carrier have been prepared, they can be placed in an 
appropriate container and labeled for treatment of an indicated condition. For 
administration of human telomerase proteins, such labeling would include amoxmt, 
frequency and method of administration. 
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Therapeutically Effective Dose 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to 
achieve the intended purpose. The determination of an effective dose is well within 
5 the capability of those skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially 
either in cell culture assays or in any appropriate animal model. The animal model is 
also used to achieve a desirable concentration range and route of administration. Such 
information can then be used to determine useful doses and routes for administration in 

10 humans. 

A therapeutically effective dose refers to that amount of protein or its 
antibodies, antagonists, or inhibitors which ameliorate the symptoms or condition. 
Therapeutic efficacy and toxicity of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals (e.g., EDjq, the 

15 dose therapeutically effective in 50% of the population; and LD50, the dose lethal to 

50% of the population). The dose ratio between therapeutic and toxic effects is the 
therapeutic index, and it can be expressed as the ratio, LD^/ED^q. Pharmaceutical 
compositions which exhibit large therapeutic indices are preferred. The data obtained 
from cell culture assays and animal studies is used in formulating a range of dosage 

20 for human use. The dosage of such compounds lies preferably within a range of 

circulating concentrations that include the ED50 with little or no toxicity. The dosage 
varies within this range depending upon the dosage form employed, sensitivity of the 
patient, and the route of administration. 

The exact dosage is chosen by the individual physician in view of the patient to 

25 be treated. Dosage and administration are adjusted to provide sufficient levels of the 

active moiety or to maintain the desired effect. Additional factors which may be taken 
into account include the severity of the disease state (e.g., tumor size and location; age, 
weight and gender of the patient; diet, time and frequency of administration, drug 
combination(s), reaction sensitivities, and tolerance/response to therapy). Long acting 

30 pharmaceutical compositions might be admmistered every 3 to 4 days, every week, or 
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once every two weeks depending on half-life and clearance rate of the particular 
formulation. Guidance as to particular dosages and methods of delivery is provided in 
the literature {See, US Patent Nos. 4,657,760; 5,206,344; and 5,225,212, herein 
incorporated by reference). Those skilled in the art will employ different formulations 
5 for nucleotides than for proteins or their inhibitors. Similarly, delivery of 

polynucleotides or polypeptides will be specific to particular cells, conditions, 
locations, etc. 

It is contemplated, for example, that human telomerase can be used as a 
therapeutic molecule combat disease {e.g., cancer) and/or problems associated with 

10 aging. It is further contemplated that antisense molecules capable of reducing the 

expression of human telomerase or telomerase protein subunits can be as therapeutic 
molecules to treat tumors associated with the aberrant expression of human telomerase. 
Still further it is contemplated that antibodies directed against human telomerase and 
capable of neutralizing the biological activity of human telomerase may be used as 

1 5 therapeutic molecules to treat tumors associated with the aberrant expression of human 

telomerase and/or telomerase protein subunits. 



EXPERIMENTAL 

The following examples are provided in order to demonstrate and further 
illustrate certain preferred embodiments and aspects of the present invention and are 

20 not to be construed as limiting the scope thereof. 

In the experimental disclosure which follows, the following abbreviations 
apply: eq (equivalents); M (Molar); |iM (micromolar); N (Normal); mol (moles); 
mmol (millimoles); ^imol (micromoles); nmol (nanomoles); g (grams); mg 
(milligrams); jxg (micrograms); ng (nanograms); 1 or L (liters); ml (milliliters); p,l 

25 (microliters); cm (centimeters); mm (millimeters); \xm (micrometers); nm 

(nanometers); °C (degrees Centigrade); RPN (ribonucleoprotein); remN (2'-0- 
methylribonucleotides); dNTP (deoxyribonucleotide); dHjO (distilled water); DDT 
(dithiothreitol); PMSF (phenylmethylsulfonyl fluoride); TE (10 mM Tris HCl, 1 mM 

- 63 - 




EDTA, approximately pH 7.2); KGlu (potassium glutamate); SSC (salt and sodium 
citrate buffer); SDS (sodivim dodecyl sulfate); PAGE (polyacrylamide gel 
electrophoresis); Novex (Novex, San Diego, CA); BioRad (Bio-Rad Laboratories, 
Hercules, CA); Pharmacia (Pharmacia Biotech, Piscataway, NJ); Boehringer-Mannheim 
5 (Boehringer-Mannheim Corp., Concord, CA); Amersham (Amersham, Inc., Chicago, 

IL); Stratagene (Stratagene Cloning Systems, La JoUa, CA); NEB (New England 
Biolabs, Beverly, MA); Pierce (Pierce Chemical Co., Rockford, IL); Beckman 
(Beckman Instruments, FuUerton, CA); Lab Industries (Lab Industries, Inc., Berkeley, 
CA); Eppendorf (Eppendorf Scientific, Madison, WI); and Molecular Dynamics 
10 (Molecular Dynamics, Suimyvale, CA). 

EXAMPLE 1 
Growth of Euplotes aediculatus 

In this Example, cultures of E. aediculatus were obtained from Dr. David 
Prescott, MCDB, University of Colorado. Dr. Prescott originally isolated this culture 
15 from pond water, although this organism is also available from the ATCC (ATCC 

#30859). Cultures were grown as described by Swanton et al, (Swanton et al.,, 
Chromosoma 77:203 [1980]), under non-sterile conditions, in 15-liter glass containers 
containing Chlorogonium as a food source. Organisms were harvested from the 
cultures when the density reached approximately 10"* cells/ml. 

20 EXAMPLE 2 

Preparation of Nuclear Extracts 

In this Example, nuclear extracts of E. aediculatus were prepared using the 
method of Lingner et al, (Lingner et al, Genes Develop., 8:1984 [1994]), with minor 
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modifications, as indicated below. Briefly, cells grown as described in Example 1 
were concentrated with 15 |am Nytex filters and cooled on ice. The cell pellet was 
resuspended in a final volume of 110 ml TMS/PMSF/spermidinephosphate buffer. 
The stock TMS/PMSF/spermidine phosphate biiffer was prepared by adding 0.075 g 
5 spermidine phosphate (USB) and 0.75 ml PMSF (from 100 mM stock prepared in 

ethanol) to 150 ml TMS. TMS comprised 10 mM Tris-acetate, 10 mM MgClj, 
85.5752 g sucrose/liter, and 0.33297 g CaCyiiter, pH 7.5. 

After resuspension in TMS/PMSF/spermidinephosphate buffer, 8.8 ml 10% NP- 
40 and 94. 1 g sucrose were added and the mixture placed in a siliconized glass beaker 

10 with a stainless steel stirring rod attached to an overhead motor. The mixture was 

stirred until the cells were completely lysed (approximately 20 minutes). The mixture 
was then centrifuged for 10 minutes at 7500 rpm (8950 x g), at 4°C, using a Beckman 
JS-13 swing-out rotor. The supernatant was removed and nuclei pellet was 
resuspended in TMS/PMSF/spermidine phosphate buffer, and centrifuged again, for 5 

15 minutes at 7500 rpm (8950 x g), at 4°C, using a Beckman JS-13 swing-out rotor. 

The supernatant was removed and the nuclei pellet was resuspended in a buffer 
comprised of 50 mM Tris-acetate, 10 mM MgCl2, 10% glycerol, 0.1% NP-40, 0.4 M 
KGlu, 0.5 mM PMSF, pH 7.5, at a volume of 0.5 ml buffer per 10 g of harvested 
cells. The resuspended nuclei were then domced in a glass homogenizer with 

20 approximately 50 strokes, and then centrifuged for 25 minutes at 14,000 rpm at 4°C, 

in an Eppendorf centrifuge. The supernatant containing the nuclear extract was 
collected, frozen in liquid nitrogen, and stored at -80°C until used. 

EXAMPLE 3 
Purification of Telomerase 

25 In this Example, nuclear extracts prepared as described in Example 2 were used 

to purify E. aediculatus telomerase. In this purification protocol, telomerase was first 
enriched by chromatography on an Affi-Gel-heparin column, and then extensively 
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purified by affinity purification with an antisense oligonucleotide. As the template 
region of telomerase RNA is accessible to hybridization in the telomerase RNP 
particle, an antisense oligonucleotide (i.e., the "affinity oligonucleotide") was 
synthesized that was complementary to this template region as an affinity bait for the 
5 telomerase. A biotin residue was included at the 5' end of the oligonucleotide to 

immobilize it to an avidin column. 

Following the binding of the telomerase to the oligonucleotide, and extensive 
washing, the telomerase was eluted by use of a displacement oligonucleotide. The 
affinity oligonucleotide included DNA bases that were not complementary to the 

10 telomerase RNA 5' to the telomerase-specific sequence. As the displacement 

oligonucleotide was complementary to the affinity oligonucleotide for its entire length, 
it was able to form a more thermodynamically stable duplex than the telomerase bound 
to the affinity oligonucleotide. Thus, addition of the displacement oligonucleotide 
resulted in the elution of the telomerase fi-om the column. 

15 In this Example, the nuclear extracts prepared from 45 liter cultures were 

frozen until a total of 34 ml of nuclear extract was collected. This corresponded to 
630 liters of culture (i.e., approximately 4 x 10^ cells). The nuclear extract was 
diluted with a buffer to 410 ml, to provide final concentrations of 20 mM Tris-acetate, 
1 mM MgCl2, 0.1 mM EDTA, 33 mM KGlu, 10% (vol/vol) glycerol, 1 mM 

20 dithiothreitol (DTT), and 0.5 mM phenylmethylsulfonyl fluoride (PMSF), at a pH of 

7.5. 

The diluted nuclear extract was applied to an Affi-Gel-heparin gel column 
(Bio-Rad), with a 230 ml bed volume and 5 cm diameter, equilibrated in the same 
buffer and eluted with a 2-liter gradient from 33 to 450 mM KGlu. The column was 

25 run at 4°C, at a flow rate of 1 column volume/hour. Fractions of 50 mis each were 

collected and assayed for telomerase activity as described in Example 4. Telomerase 
was eluted from the column at approximately 170 mM KGlu. Fractions containing 
telomerase (approximately 440 ml) were pooled and adjusted to 20 mM Tris-acetate, 
10 mM MgClj, 1 mM EDTA, 300 mM KGlu, 10% glycerol, 1 mM DTT, and 1% 

30 Nonidet P-40. This buffer was designated as "WB." 




To this preparation, 1.5 nmol of each of two competitor DNA oligonucleotides 
(5'-TAGACCTGTTAGTGTACATTTGAATTGAAGC-3' (SEQ ID NO:28)) and (5'- 
TAGACCTGTTAGGTTGGATTTGTGGCATCA-3' (SEQ ID NO:29)), 50 [xg yeast 
RNA (Sigma), and 0.3 nmol of biotin-labelled telomerase- specific oligonucleotide (5'- 
5 biotin-TAGACCTGTTA-(rmeG)2-(rmeU)4-(rmeG)4-(rmeU)4-remG-3')(SEQ ID NO:60), 

were added per ml of the pool. The 2-(9-methyribonucleotides of the telomerase 
specific oligonucleotides were complementary to the telomerase RNA template region; 
the deoxyribonucleotides were not complementary. The inclusion of competitor, non- 
specific DNA oligonucleotides increased the efficiency of the purification, as the 

10 effects of nucleic acid binding proteins and other components in the mixture that 

would either bind to the affinity oligonucleotide or remove the telomerase from the 
mixture were minimized. 

This material was then added to Ultralink immobilized neutravidin plus (Pierce) 
colunm material, at a volume of 60 \d of suspension per ml of pool. The column 

15 material was pre-blocked twice for 15 minutes each blocking, with a preparation of 

WB containing 0.01% Nonidet P-40, 0.5 mg BSA, 0.5 mg/ml lysozyme, 0.05 mg/ml 
glycogen, and 0.1 mg/ml yeast RNA. The blocking was conducted at 4°C, using a 
rotating wheel to thoroughly block the column material. After the first blocking step, 
and before the second blocking step, the column material was centrifuged at 200 x g 

20 for 2 minutes to pellet the matrix. 

The pool-column mixture was incubated for 8 minutes at 30°C, and then for an 
additional 2 hours at 4°C, on a rotating wheel (approximately 10 rpm; Labindustries) 
to allow binding. The pool-column mixture was then centrifuged 200 xg for 2 
minutes, and the supernatant containing unbound material was removed. The pool- 

25 column mixture was then washed. This washing process included the steps of rinsing 
the pool-column mixture with WB at 4°C, washing the mixture for 15 minutes with 
WB at 4°C, rinsing with WB, washing for 5 minutes at 30°C, with WB containing 0.6 
M KGlu, and no Nonidet P-40, washing 5 minutes at 25°C with WB, and finally, 
rinsing again with WB. The volume remaining after the final wash was kept small, in 

30 order to yield a ratio of buffer to column material of approximately 1:1. 
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Telomerase was eluted from the column material by adding 1 nmol of 
displacement deoxyoligonucleotide (5'-CA4C4A4C2TA2CAG2TCTA-3')(SEQ ID 
NO:30), per ml of column material and incubating at 25°C for 30 minutes. The 
material was centrifuged for 2 minutes 14,000 rpm in a microcentrifuge (Eppendorf), 
5 and the eluate collected. The elution procedure was repeated twice more, using fresh 
displacement oligonucleotide each time. As mentioned above, because the 
displacement oligonucleotide was complementary to the affinity oligonucleotide, it 
formed a more thermodynamically stable complex with the affinity oligonucleotide 
than the telomerase. Thus, addition of the displacement oligonucleotide to an affmity- 

10 bound telomerase resulted in efficient elution of telomerase under native conditions. 

The telomerase appeared to be approximately 50% pure at this stage, as judged by 
analysis on a protein gel. The affinity purification of telomerase and elution with a 
displacement oligonucleotide is shown in Figure 1 (panels A and B, respectively). In 
this Figure, the 2'-0-methyl sugars of the affinity oligonucleotide are indicated by the 

15 bold line. The black and shaded oval shapes in this Figure are intended to graphically 

represent the protein subunits of the present invention. 

The protein concentrations of the extract and material obtained following Affi- 
Gel-heparin column chromatography, were determined using the method of Bradford 
(Bradford, Anal. Biochem., 72:248 [1976]), using BSA as the standards. Only a 

20 fraction of the telomerase preparation was further purified on a glycerol gradient. 

The sedimentation coefficient of telomerase was determined by glycerol 
gradient centrifugation, as described in Example 8. 

Table 1 below is a purification table for telomerase purified according to the 
methods of this Example. The telomerase was enriched 12-fold in nuclear extracts, as 

25 compared to whole cell extracts, with a recovery of 80%; 85% of telomerase was 

solubilized from nuclei upon extraction. 
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Table 1. Purification of Telomerase 





Protein (mg) 


Telomerase 
(pmol of 
RNP) 


Telomerase/ 
Protein/pmol 
of RNPMgi 


Recovery 


Puiil'icaliiMi 
Factor 


Nuclear 
Extract 


2020 


1720 


0.9 


100 


1 


Heparin 


125 


1040 


8.3 


60 


10 


Affinity 


0.3** 


680 


2270 


40 


2670 


Glycerol 
Gradient 


NA* 


NA* 


NA* 


25 


NA* 



*NA=Not available 



10 **This value was calculated from the measiired amount of telomerase (680 pmol), by 

assuming a purity of 50% (based on a protein gel). 



EXAMPLE 4 
Telomerase Activity 

At each step in the purification of telomerase, the preparation was analyzed by 
15 three separate assays, one of which was activity, as described in this Example. In 

general, telomerase assays were done in 40 [i\ containing 0.003-0.3 |li1 of nuclear 
extract, 50 mM Tris-Cl (pH 7.5), 50 mM KGlu, 10 mM MgClj, 1 mM DTT, 125 
dTTP, 125 jj,M dGTP, and approximately 0.2 pmoles of 5'-^^P-labelled oligonucleotide 
substrate (i.e., approximately 400,000 cpm). Oligonucleotide primers were heat- 
20 denatured prior to their addition to the reaction mixture. Reactions were assembled on 

ice and incubated for 30 minutes at 25°C. The reactions were stopped by addition of 
200 |il of 10 mM Tris-Cl (pH 7.5), 15 mM EDTA, 0.6% SDS, and 0.05 mg/ml 
proteinase K, and incubated for at least 30 minutes at 45°C. After ethanol 
precipitation, the products were analyzed on denaturing 8% PAGE gels, as known in 
25 the art (See e.g., Sambrook et al, 1989). 
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EXAMPLE 5 
Quantification of Telomerase Activity 

In this Example, quantification of telomerase activity through the purification 
procedure is described. Quantitation was accomplished by assaying the elongation of 
5 oligonucleotide primers in the presence of dGTP and [a -^^P]dTTP. Briefly, 1 \iM 5'- 

(04X4)2-3' oligonucleotide was extended in a 20 |li1 reaction mixture in the presence of 
2 lal of [a -^2p]dTTP (10 mCi/ml, 400 Ci/mmol; I Ci=37 GBq), and 125 [iM dGTP 
as described by (Lingner et al, Genes Develop., 8:1984 [1994]), and loaded onto an 
8% PAGE sequencing gel as known in the art {See e.g., Sambrook et al, 1989). 

10 The results of this study are shown in Figure 3. In lane 1, there is no 

telomerase present {i.e., a negative control); lanes 2, 5, 8, and 11 contained 0.14 fmoi 
telomerase; lanes 3,6,9, and 12 contained 0.42 fmol telomerase; and lanes 4, 7, 10, and 
13 contained 1.3 fmol telomerase. Activity was quantified using a Phosphorlmager 
(Molecular Dynamics) using the manufacturer's instructions. It was determined that 

15 vmder these conditions, 1 fmol of affinity-purified telomerase incorporated 21 fmol of 

dTTP in 30 minutes. 

As shown in this figure, the specific activity of the telomerase did not change 
significantly through the purification procedure. Affinity-purified telomerase was fully 
active. However, it was determined that at high concentrations, an inhibitory activity 

20 was detected and the activity of crude extracts was not linear. Thus, in the assay 

shown in Figure 3, the crude extract was diluted 700-7000-fold. Upon purification, 
this inhibitory activity was removed and no inhibitory effect was detected in the 
purified telomerase preparations, even at high enzyme concentrations. 
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EXAMPLE 6 
Gel Electrophoresis and Northern Blots 

As indicated in Example 4, at each step in the purification of telomerase, the 
preparation was analyzed by three separate assays. This Example describes the gel 
5 electrophoresis and blotting procedures used to quantify telomerase RNA present in 

fractions and analyze the integrity of the telomerase ribonucleoprotein particle. 

Denaturing Gels and Northern Blots 

In this Example, synthetic TT-transcribed telomerase RNA of known 
concentration served as the standard. Throughout this investigation, the RNA 

10 component was used as a measure of telomerase. 

A construct for phage T7 RNA polymerase transcription of E. aediculatus 
telomerase RNA was produced, using the polymerase chain reaction (PCR). The 
telomerase RNA gene was amplified with primers that annealed to either end of the 
gene. The primer that annealed at the 5' end also encoded a hammerhead ribozyme 

15 sequence to generate the natural 5' end upon cleavage of the transcribed RNA, a TV- 

promoter sequence, and an EcoKl site for subcloning. The sequence of this 5' primer 
was 5'- 

GCGGGAATTCTAATACGACTCACTATAGGGAAGAAACTCTGATGAGGCCGAA 
AGGCCGAAACTCCACGAAAGTGGAGTAAGTTTCTCGATAATTGATCTGTAG- 

20 3' (SEQ ID N0:31). The 3' primer included an Earl site for termination of 

transcription at the natural 3' end, and a BamRl site for cloning. The sequence of this 
3' primer was 5'-CGGGGATCCTCTTCAAAAGATGAGAGGACAGCAAAC-3' (SEQ 
ID NO:32). The PCR amplification product was cleaved with EcoRl and BamRl, and 
subcloned into the respective sites of pUC19 (NEB), to give "pEaTV." The correctness 

25 of this insert was confirmed by DNA sequencing. T7 transcription was performed as 
described by Zaug et al, Biochemistry 33:14935 [1994]), with £arl-linearized 
plasmid. RNA was gel-purified and the concentration was determined (an Ajgo of 1 = 



- 71 - 




40 fig/ml). This RNA was used as a standard to determine the telomerase RNA 
present in various preparations of telomerase. 

The signal of hybridization was proportional to the amount of telomerase RNA, 
and the derived RNA concentrations were consistent with, but slightly higher than 
5 those obtained by native gel electrophoresis. Comparison of the amount of whole 

telomerase RNA in whole cell RNA to serial dilutions of known T7 RNA transcript 
concentrations indicated that each E. aediculatus cell contained approximately 300,000 
telomerase molecules. 

Visualization of the telomerase was accomplished by Northern blot 

10 hybridization to its RNA component, using the methods described by Lingner et al. 

(Linger et al. Genes Develop., 8:1984 [1994]). Briefly, RNA (less than or equal to 
0.5 )ag/lane) was resolved on an 8% PAGE and electroblotted onto a Hybond-N 
membrane (Amersham), as known in the art ( See e.g., Sambrook et al, 1989). The 
blot was hybridized overnight in 10 ml of 4x SSC, lOx Denhardt's solution, 0.1% 

15 SDS, and 50 pg/ml denatured herring sperm DNA,. After pre-hybridizing for 3 hours, 

2x10* cpm probe/ml hybridization solution was added. The randomly labelled probe 
was a PCR-product that covered the entire telomerase RNA gene. The blot was 
washed with several buffer changes for 30 minutes in 2x SSC, 0.1% SDS, and then 
washed for 1 hour in 0.1 x SSC and 0.1% SDS at 45°C. 

20 Native Gels and Northern Blots 

In this experiment, the purified telomerase preparation was run on native {le., 
non-denaturing) gels of 3.5%) polyacrylamide and 0.33% agarose, as knovm in the art 
and described by Lamond and Sproat (Lamond and Sproat, [1994], supra). The 
telomerase comigrated approximately with the xylene cyanol dye. 
25 The native gel results indicated that telomerase was maintained as an RNP 

throughout the purification protocol. Figure 2 is a photograph of a Northern blot 
showing the mobility of the telomerase in different fractions on a non-denaturing gel 
as well as in vitro transcribed telomerase. In this figure, lane 1 contained 1.5 fmol 



- 72 - 




telomerase RNA, lane 2 contained 4.6 fmol telomerase RNA, lane 3 contained 14 fmol 
telomerase RNA, lane 4 contained 41 fmol telomerase RNA, lane 5 contained nuclear 
extract (42 fmol telomerase), lane 6 contained Affi-Gel-heparin-purified telomerase (47 
fmol telomerase), lane 7 contained affinity-purified telomerase (68 fmol), and lane 8 
5 contained glycerol gradient-purified telomerase (35 fmol). 

As shown in Figure 2, in nuclear extracts, the telomerase was assembled into 
an RNP particle that migrated slower than unassembled telomerase RNA. Less than 
1% free RNA was detected by this method. However, a slower migrating telomerase 
RNP complex was also sometimes detected in extracts. Upon purification on the Affi- 

10 Gel-heparin column, the telomerase RNP particle did not change in mobility (Figure 2, 

lane 6). However, upon affinity purification the mobility of the RNA particle slightly 
increased (Figure 2, lane 7), perhaps indicating that a protein subunit or fragment had 
been lost. On glycerol gradients, the affinity-purified telomerase did not change in 
size, but approximately 2% free telomerase RNA was detectable (Figure 2, lane 8), 

15 suggesting that a small amount of disassembly of the RNP particle had occurred. 



EXAMPLE 7 
Telomerase Protein Composition 

In this Example, the analysis of the purified telomerase protein composition are 
described. 

20 In this Example, glycerol gradient fractions obtained from Example 8, were 

separated on a 4-20% polyacrylamide gel (Novex). Following electrophoresis, the gel 
was stained with Coomassie brilliant blue. Figure 4 shows a photograph of the gel. 
Lanes 1 and 2 contained molecular mass markers (Pharmacia) as indicated on the left 
side of the gel shown in Figure 4. Lanes 3-5 contained glycerol gradient fraction 

25 pools as indicated on the top of the gel (i.e., lane 3 contained fractions 9-14, lane 4 

contained fractions 15-22, and lane 5 contained fractions 23-32). Lane 4 contained the 
pool with 1 pmol of telomerase RNA. In lanes 6-9 BSA standards were run at 
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concentrations indicated at the top of the gel in Figure 4 (i.e., lane 6 contained 0.5 
pmol BSA, lane 7 contained 1.5 pmol BSA, lane 8 contained 4.5 BSA, and lane 9 
contained 15 pmol BSA). 

As shown in Figure 4, polypeptides with molecular masses of 120 and 43 kDa 
5 co-purified with the telomerase. The 43 kDa polypeptide was observed as a doublet. 

It was noted that the polypeptide of approximately 43 kDa in lane 3 migrated 
differently than the doublet in lane 4; it may be an unrelated protein. The 120 kDa 
and 43 kDa doublet each stained with Coomassie brilliant blue at approximately the 
level of 1 pmol, when compared with BSA standards. Because this fraction contained 

10 1 pmol of telomerase RNA, all of which was assembled into an RNP particle (See, 

Figure 2, lane 8), there appear to be two polypeptide subunits that are stoichiometric 
with the telomerase RNA. However, it is also possible that the two proteins around 43 
kDa are separate enzyme subunit.s 

Affinity-purified telomerase that was not subjected to fractionation on a 

15 glycerol gradient contained additional polypeptides with apparent molecular masses of 

35 and 37 kDa, respectively. This latter fraction was estimated to be at least 50% 
pure. However, the 35 kDa and 37 kDa polypeptides that were present in the affinity- 
purified material were not reproducibly separated by glycerol gradient centrifugation. 
These polypeptides may be contaminants, as they were not visible in all activity- 

20 containing preparations. 



EXAMPLE 8 
Sedimentation Coefficient 

The sedimentation coefficient for telomerase was determined by glycerol 
gradient centrifugation. In this Example, nuclear extract and affinity-purified 
25 telomerase were fractionated on 15-40% glycerol gradients containing 20 mM Tris- 

acetate, with 1 mM MgClj, 0.1 mM EDTA, 300 mM KGlu, and 1 mM DTT, at pH 
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7.5. Glycerol gradients were poured in 5 ml (13 x 51 mm) tubes, and centrifuged 
using an SW55Ti rotor (Beckman) at 55,000 rpm for 14 hours at 4°C. 

Marker proteins were run in a parallel gradient and had a sedimentation 
coefficient of 7.6 S for alcohol dehydrogenase (ADH), 113 S for catalase, 17.3 S for 
5 apoferritin, and 19.3 S for thyroglobulin. The telomerase peak was identified by 

native gel electrophoresis of gradient fractions followed by blot hybridization to its 
RNA component. 

Figure 5 is a graph showing the sedimentation coefficient for telomerase. As 
shown in this Figure, affinity-purified telomerase co-sedimented with catalase at 11.5 

10 S, while telomerase in nuclear extracts sedimented slightly faster, peaking around 12.5 
S. Therefore, consistent with the mobility of the enzyme in native gels, purified 
telomerase appears to have lost a proteolytic fi-agment or a loosely associated subunit. 

The calculated molecular mass for telomerase, if it is assumed to consist of 
one 120 kDa protein subunit, one 43 kDa subunit, and one RNA subunit of 66 kDa, 

15 adds up to a total of 229 kDa. This is in close agreement with the 232 kDa molecular 

mass of catalase. However, the sedimentation coefficient is a function of the 
molecular mass, as well as the partial specific volume and the frictional coefficient of 
the molecule, both of which are unknown for the telomerase RNP. 



EXAMPLE 9 

20 Substrate Utilization 

In this Example, the substrate requirements of telomerase were investigated. 
One simple model for DNA end replication predicts that after semi-conservative DNA 
replication, telomerase extends double-stranded, blunt-ended DNA molecules. In a 
variation of this model, a single- stranded 3' end is created by a helicase or nuclease 
25 after replication. This 3' end is then used by telomerase for binding and extension. 

To determine whether telomerase is capable of elongating blunt-ended 
molecules, model hairpins were synthesized with telomeric repeats positioned at their 
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3' ends. These primer substrates were gel-purified, 5 '-end labelled with 
polynucleotide kinase, heated at 0.4 \iM to 80°C for 5 minutes, and then slowly cooled 
to room temperature in a heating block, to allow renaturation and heUx formation of 
the hairpins. Substrate mobility on a non-denaturing gel indicated that very efficient 
5 hairpin formation was present, as compared to dimerization. 

In this Example, assays were performed with unlabelled 125 p,M dGTP, 125 
|LiM dTTP, and 0.02 [iM 5 '-end-labelled primer (5'-^2p-labelled oligonucleotide 
substrate) in 10 yd reaction mixtures that contained 20 mM Tris-acetate, with 10 mM 
MgClj, 50 mM KGlu, and 1 mM DTT, at pH 7.5. These mixtures were incubated at 

10 25°C for 30 minutes. Reactions were stopped by adduig formamide loading buffer 
(i.e., TBE, formamide, bromthymol blue, and cyanol, Sambrook, 1989, supra). 

Primers were incubated without telomerase ("-"), with 5.9 fmol of affinity- 
purified telomerase ("+"), or with 17.6 fmol of affinity-purified telomerase ("+++"). 
Affinity-purified telomerase used in this assay was dialyzed with a membrane having a 

15 molecular cut-off of 100 kDa, in order to remove the displacement oligonucleotide. 

Reaction products were separated on an 8% PAGE/urea gel containing 36% 
formamide, to denature the hairpins. The sequences of the primers used in this study, 
as well as their lane assignments are shown in Table 2. 
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TABLE 2. Primer Sequences 





Primer Sequence (5' to 3') 




1-3 


C4(A4C4)3CACA(G4T4)3G4 


SEQ ID NO:33 


4-6 


C2(A4C4)3CACA(G4T4) G 


SEQ ID NO:34 


7-9 


(A4C4)3CACA(G4T4)3G4 


SEQ ID NO:35 


10-12 


A2C4(A4C4)2CACA(G4T4)3G4 


SEQ ID NO;36 


13-15 


C4(A4C4)2CACA(G4T4)3 


SEQ ID NO:37 


16-18 


(A4C4)3CACA(G4T4)3 


SEQ ID NO:38 


19-21 


A2C4(A4C4)2CACA(G4T4)3 


SEQ ID NO:39 


22-24 


C4(A4C4)2CACA(G4T4)3 


SEQ ID NO:40 


25-27 


C2(A4C4)2CACA(G4T4)3 


SEQ ID N0:41 


28-30 


(A4C4)2CACA(G4T4)3 


SEQ ID NO:42 



The gel results are shown in Figure 6. Lanes 1-15 contained substrates with 
telomeric repeats ending with four G residues. Lanes 16-30 contained substrates with 

15 telomeric repeats ending with four T residues. The putative alignment on the 

telomerase RNA template is indicated in Figure 7 (SEQ ID NOS:43 and 44, and 45 
and 46, respectively). It was assumed that the primer sets anneal at two very different 
positions in the template shown in Figure 7 (i.e., 7 A and 7B, respectively). This may 
have affected their binding and/or elongation rate. 

20 Figure 8 shows a lighter exposure of lanes 25-30 in Figure 6. The lighter 

exposure of Figure 8 was taken in order to permit visualization of the nucleotides that 
are added and the positions of pausing in elongated products. Percent of substrate 
elongated for the third lane in each set was quantified on a Phosphorlmager, as 
indicated on the bottom of Figure 6. 

25 The substrate efficiencies for these hairpins were compared with double- 

stranded telomere-like substrates with overhangs of differing lengths. A model 
substrate that ended with four G residues (see lanes 1-15 of Figure 6), was not 
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elongated when it was blunt ended (see lanes 1-3). However, slight extension was 
observed with an overhang length of two bases; elongation became efficient when the 
overhang was at least 4 bases in length. The telomerase acted in a similar manner 
with a double-stranded substrate that ended with four T residues, with a 6-base 
5 overhang required for highly efficient elongation. In Figure 6, the faint bands below 

the primers in lanes 10-15 that are independent of telomerase represent shorter 
oligonucleotides in the primer preparations. 

The lighter exposure of lanes 25-30 in Figure 8 shows a ladder of elongated 
products, with the darkest bands correlating with the putative 5' boundary of the 

10 template (as described by Lingner et al, Genes Develop., 8:1984 [1994]). The 

abundance of products that correspond to other positions in the template suggested that 
pausing and/or dissociation occurs at sites other than the site of translocation with the 
purified telomerase. 

As shown in Figure 6, double-stranded, blunt-ended oligonucleotides were not 

15 substrates for telomerase. To determine whether these molecules would bind to 

telomerase, a competition experiment was performed. In this experiment, 2 nM of 5'- 
end labelled substrate with the sequence (G4T4)2 (SEQ ID N0:61), or a hairpin 
substrate with a six base overhang respectively were extended with 0.125 nM 
telomerase (Figiire 6, lanes 25-27). Although the same unlabeled oligonucleotide 

20 substrates competed efficiently with labelled substrate for extension, no reduction of 

activity was observed when the double-stranded blunt-ended hairpin oligonucleotides 
were used as competitors, even in the presence of 100-fold excess hairpins. 

These results indicated that double-stranded, blunt-ended oligonucleotides 
cannot bind to telomerase at the concentrations tested in this Example. Rather, a 

25 single- stranded 3' end is required for binding. It is likely that this 3' end is required 

to base pair with the telomerase RNA template. 
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EXAMPLE 10 
Cloning & Sequencing of the 123 kDa Polypeptide 

In this Example, the cloning of the 123 kDa polypeptide of telomerase (i.e., the 
123 kDa protein subunit) is described. In this study, an internal fragment of the 
5 telomerase gene was amplified by PGR, with oligonucleotide primers designed to 

match peptide sequences that were obtained from the purified polypeptide obtained in 
Example 3, above. The polypeptide sequence was determined using the nanoES 
tandem mass spectroscopy methods known in the art and described by Calvio et al, 
RNA 1:724-733 [1995]). The oligonucleotide primers used in this Example had the 
10 following sequences, with positions that were degenerate shown in parentheses~5'- 

TCT(G/A)AA(G/A)TA(G/A)TG(T/G/A)GT(G/A/T/C)A(T/G/A)(G/A)TT(G/A)TTCAT- 
3' (SEQ ID NO:47), AND 5'- 

GCGGATCCATGAA(T/C)CC(A/T)GA(G/A)AA(T/C)CC(A/T)AA(T/C)GT-3 ' (SEQ 
ID NO:48). 

15 A 50 reaction contained 0.2 mM dNTPs, 0.15 |j,g E. aediculatus 

chromosomal DNA, 0.5 \x\ Tag (Boehringer-Mannheim), 0.8 |ag of each primer, and 
Ix reaction buffer (Boehringer-Mannheim). The reaction was incubated in a 
thermocycler (Perkin-Elmer), using the following~5 minutes at 95°C, followed by 30 
cycles of 1 minute at 94°C, 1 minute at 52°C, and 2 minutes at 72°C. The reaction 

20 was completed by 10 minute incubation at 72°C. 

A genomic DNA library was prepared from the chromosomal E. aediculatus 
DNA by cloning blunt-ended DNA into the Smal site of pCR-Script plasmid vector 
(Stratagene). This library was screened by colony hybridization, with the 
radiolabelled, gel-purified PGR product. Plasmid DNA of positive clones was 

25 prepared and sequenced by the dideoxy method (Sanger et al., Proc. Natl. Acad. Sci., 

74:5463 [1977]) or manually, through use of an automated sequencer (ABI). The 
DNA sequence of the gene encoding this polypeptide is shown in Figure 9 (SEQ ID 
N0:1). The start codon in this sequence inferred from the DNA sequence, is located 
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at nucleotide position 101, and the open reading frame ends at position 3193. The 
genetic code of Euplotes differs from other organisms in that the "UGA" codon 
encodes a cysteine residue. The amino acid sequence of the polypeptide inferred from 
the DNA sequence is shown in Figure 10 (SEQ ID N0:2), and assumes that no 
5 unusual amino acids are inserted during translation and no post-translational 

modification occurs. 



EXAMPLE 11 
Cloning & Sequencing of the 43 kDa Polypeptide 

In this Example, the cloning of the 43 kDa polypeptide of telomerase {i.e., the 
10 43 kDa protein subunit) is described. In this study, an internal fragment of the 
telomerase gene was amplified by PGR, with oligonucleotide primers designed to 
match peptide sequences that were obtained from the purified polypeptide obtained in 
Example 3, above. The polypeptide sequence was determined using the nanoES 
tandem mass spectroscopy methods known in the art and described by Galvio et al, 
15 RNA 1:724-733 [1995]). The oligonucleotide primers used in this Example had the 

following sequences--5'-NNNGTNAC(C/T/A)GG(C/T/A)AT(G/T/A)AA(G/T)AA-3' 
(SEQ ID NO:49), and 5'- 

(T/G/A)GG(T/G/A)GT(G/T)TG(T/G)TG(G/A)TG(G/A)TT(G/A)TA-3 ' (SEQ ID 
NO:50). In this sequence, "N" indicates the presence of any of the four nucleotides 

20 {i.e., A, T, G, or C). 

A 50 \x\ reaction contained 0.2 mM dNTPs, 0.2 \xg E. aediculatus chromosomal 
DNA, 0.5 |j,l Taq (Boehringer-Mannheim), 0.8 \ig of each primer, and Ix reaction 
buffer (Boehringer-Mannheim). The reaction was incubated in a thermocycler (Perkin- 
Elmer), using the following~5 minutes at 95°C, followed by 30 cycles of 1 minute at 

25 94°C, 1 minute at 52°C, and 1 minutes at 72°C. The reaction was completed by 10 

minute incubation at 72°C. 
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A genomic DNA library was prepared from the chromosomal E. aediculatus 
DNA by cloning blunt-ended DNA into the Smal site of pCR-Script plasmid vector 
(Stratagene). This library was screened by colony hybridization, with the 
radiolabelled, gel-purified PGR product. Plasmid DNA of positive clones was 
5 prepared and sequenced by the dideoxy method (Sanger et ai, Proc. Natl. Acad. Sci., 

74:5463 [1977]) or manually, through use of an automated sequencer (ABI). The 
DNA sequence of the gene encoding this polypeptide is shown in Figure 1 1 (SEQ ID 
N0:3). Three potential reading frames are shown for this sequence, as shown in 
Figure 12. For clarity, the amino acid sequence is indicated below the nucleotide 

10 sequence in all three reading frames. These reading frames are designated as "a," "b," 

and "c" (SEQ ID NOS:4-6). A possible start codon is encoded at nucleotide position 
84 in reading frame "c." They coding region could end at position 1501 in reading 
frame "b." Early stop codons, indicated by asterisks in this figure, occur in all three 
reading frames between nucleotide position 337-350. 

15 The "La-domain" is indicated in bold-face type. Further downstream, the 

protein sequence appears to be encoded by different reading frames, as none of the 
three frames is uninterrupted by stop codons. Furthermore, peptide sequences from 
purified protein are encoded in all three frames. Therefore, this gene appears to 
contain intervening sequences, or in the alternative, the RNA is edited. Other 

20 possibilities include ribosomal frame-shifting or sequence errors. However, the 
homology to the La-protein sequence remains of significant interest. Again, in 
Euplotes, the "UGA" codon encodes a cysteine residue. 

EXAMPLE 12 
Amino Acid and Nucleic Acid Comparisons 

25 In this Example, comparisons between various reported sequences and the 

sequences of the 123 kDa and 43 kDa telomerase subunit polj^eptides were made. 
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Comparisons with the 123 kDa E. aediculatus Telomerase Subunit 

The amino acid sequence of the 123 kDa Euplotes aediculatus polypeptide was 
compared with the sequence of the 80 kDa telomerase protein subunit of Tetrahymena 
thermophila (GenBank accession #U25641) in order to investigate their similarity. 
5 The nucleotide sequence as obtained from GenBank (SEQ ID N0:51) encoding this 

protein is shown in Figure 19. The amino acid sequence of this protein as obtained 
from GenBank (SEQ ID NO:52) is shown in Figure 20. The sequence comparison 
between the 123 kDa E. aediculatus and 80 kDa T. thermophila is shown in Figure 13. 
In this figure, the E. aediculatus sequence is the upper sequence (SEQ ID N0:2), 

10 while the T. thermophila sequence is the lower sequence (SEQ ID NO:52). In this 

Figure, as well as Figures 14-16, identities are indicated by vertical bars, while single 
dots between the sequences indicate somewhat similar amino acids, and double dots 
between the sequences indicate more similar amino acids. The observed identity was 
determined to be approximately 19%, while the percent similarity was approximately 

15 45%, values similar to what would be observed with any random protein sequence. 

The amino acid sequence of the 123 kDa Euplotes aediculatus polypeptide was 
also compared with the sequence of the 95 kDa telomerase protein subunit of 
Tetrahymena thermophila (GenBank accession #U25642), in order to investigate their 
similarity. The nucleotide sequence as obtained from GenBank (SEQ ID NO:53) 

20 encoding this protein is shown in Figure 21. The amino acid sequence of this protein 

as obtained from GenBank (SEQ ID NO:54) is shown in Figure 22. This sequence 
comparison is shown in Figure 14. In this figure, the E. aediculatus sequence is the 
upper sequence (SEQ ID N0:2), while the T. thermophila sequence is the lower 
sequence (SEQ ID NO:54); identities are indicated by vertical bars. The observed 

25 identity was determined to be approximately 20%), while the percent similarity was 

approximately 43%, values similar to what would be observed with any random 
protein sequence. 

Significantly, the amino acid sequence of the 123 kDa E. aediculatus 
polypeptide contains the five motifs (SEQ ID N0S:13 and 18) characteristic of reverse 
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transcriptases. The 123 kDa polypeptide was also compared with the polymerase 
domains of various reverse transcriptases (SEQ ID NOS:14-17, and 19-22). Figure 17 
shows the alignment of the 123 kDa polypeptide with the putative yeast homolog 
(L8543.12 or ESTp)(SEQ ID NOS: 17 and 22). The amino acid sequence of 
5 L8543.12 (or ESTp) obtained from GenBank is shown in Figure 23 (SEQ ID NO:55). 

Four motifs (A, B, C, and D) were included in this comparison. In this Figure 
17, highly conserved residues are indicated by white letters on a black background. 
Residues of the E. aediculatus sequences that are conserved in the other sequence are 
indicated in bold; the "h" indicates the presence of a hydrophobic amino acid. The 

10 numerals located between amino acid residues of the motifs indicates the length of 

gaps in the sequences. For example, the "100" shown between motifs A and B reflects 
a 1 00 amino acid gap in the sequence between the motifs. 

Genbank searches identified a yeast protein with the accession number u20618, 
and gene "L8543.12" (Est2), whose amino acid sequence shows some homology to the 

15 E. aediculatus 123 kDa telomerase subunit. Based on the observations that both 

proteins contain reverse transcriptase motifs in their C-terminal regions; both proteins 
share similarity in regions outside the reverse transcriptase motif; the proteins are 
similarly basic (pi = 10.1 for E. aediculatus and pl=10.0 for the yeast); and both 
proteins are large (123 kDa for E. aediculatus and 103 kDa for the yeast), these 

20 sequences comprise the catalytic core of their respective telomerases. It is 

contemplated that based on this observation of homology in two phylogenetically 
distinct organisms as E. aediculatus and yeast, the human telomerase will contain a 
protein that has the same characteristics (i.e., reverse transcriptase motifs, is basic, and 
large [> 100 kDa]). 



25 Comparisons with the 43 kDa E. aediculatus Telomerase Subunit 

The amino acid sequence of the "La-domain" of the 43 kDa Euplotes 
aediculatus polypeptide was compared with the sequence of the 95 kDa telomerase 
protein subunit of Tetrahymena thermophila (described above) in order to investigate 
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their similarity. This sequence comparison is shown in Figure 15. In this figure, the 
E. aediculatus sequence is the upper sequence (SEQ ID N0:9), while the T. 
thermophila sequence is the lower sequence (SEQ ID NO: 10); identities are indicated 
by vertical bars. The observed identity was determined to be approximately 23%, 
5 while the percent similarity was approximately 46%, values similar to what would be 

observed with any random protein sequence. 

The amino acid sequence of the "La-domain" of the 43 kDa Euplotes 
aediculatus polypeptide was compared with the sequence of the 80 kDa telomerase 
protein subimit of Tetrahymena thermophila (described above) in order to investigate 

10 their similarity. This sequence comparison is shown in Figure 16. In this figure, the 
E. aediculatus sequence is the upper sequence (SEQ ID NO: 11), while the T. 
thermophila sequence is the lower sequence (SEQ ID NO: 12); identities are indicated 
by vertical bars. The observed identity was determined to be approximately 26%, 
while the percent similarity was approximately 49%, values similar to what would be 

15 observed with any random protein sequence. 

The amino acid sequence of a domain of the 43 kDa E. aediculatus polypeptide 
(SEQ ID NO:23) was also compared with La proteins from various other organisms 
(SEQ ID NOS:24-27). These comparisons are shown in Figure 18. In this Figure, 
highly conserved residues are indicated by white letters on a black background. 

20 Residues of the E. aediculatus sequences that are conserved in the other sequence are 

indicated in bold. 

EXAMPLE 13 
Identification of Telomerase Protein Subunits 
in Anothier Organism 

25 In this Example, the sequences identified in the previous Examples above, were 

used to identify the telomerase protein subunits of Oxytricha trifallax, a ciliate that is 
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very distantly related to E. aediculatus. In this Example, primers were chosen based 
on the conserved region of the E. aediculatus 123 kDa polypeptide which comprised 
the reverse transcriptase domain motifs. Suitable primers were synthesized and used in 
a PGR reaction with total DNA from Oxytricha. The Oxytricha DNA was prepared 
5 according to methods known in the art. The PGR products were then cloned and 

sequenced using methods known in the art. 

The oligonucleotide sequences used as the primers were as follows: 5'- 
(T/G)A(A/G)AG(T/A/G)AA(G/A)GG(T/A/G)AT(T/G)GG(G/T/A)(G/T)A(G/A)GG-3' 
(SEQ ID NO:56) and 

10 5 ' -(G/A/T)GT(G/A/T) ATNA(G/A)NA(G/A)(G/A)TA(G/A)TC(G/A)TC-3 ' (SEQ ID 

NO: 5 7). Positions that were degenerate are shown in parenthesis, with the alternative 
bases shown within the parenthesis. "N" represents any of the four nucleotides. 

In the PGR reaction, a 50 \i\ reaction contained 0.2 mM dNTPs, 0.3 p-g 
Oxytricha trifallax chromosomal DNA, 1 i^l Taq polymerase (Boehringer-Mannheim), 

15 2 micromolar of each primer, Ix reaction buffer (Boehringer-Mannheim). The reaction 

was incubated in a thermocycler (Perkin-Elmer) under the following conditions: Ix 5 
min at 95°C, 30 cycles consisting of 1 min at 94°C, 1 min at 53°C, and 1 min at 
72°C, followed by Ix 10 min at 72°C. The PCR-product was gel-purified and 
sequenced by the dideoxy-method, by methods known well in the art {e.g., Sanger et 

20 al, Proc. Natl. Acad. Sci. 74, 5463-5467 (1977). 

The deduced amino acid sequence of the PGR product was determined and 
compared with the E. aediculatus sequence. Figure 24 shows the alignment of these 
sequences, with the O. trifallax sequence (SEQ ID NO: 5 8) shown in the top row, and 
the E. aediculatus sequence (SEQ ID NO: 5 9) shown in the bottom row. As can be 

25 seen from this Figure, there is a great deal of homology between the O. trifallax 

polypeptide sequence identified in this Example with the E. aediculatus polypeptide 
sequence. Thus, it is clear that the sequences identified in the present invention are 
useful for the identification of homologous telomerase protein subunits in other 
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eiikaryotic organisms. Indeed, development of the present invention has identified 
homologous telomerase sequences in multiple, diverse species. 



EXAMPLE 15 
Identification of Tetrahymena Telomerase Sequences 

5 In this Example, a Tetrahymena clone was produced that shares homology with 

the Euplotes sequences, and EST2p. 

This experiment utilized PGR with degenerate oligonucleotide primers directed 
against conserved motifs to identify regions of homology between Tetrahymena, 
Euplotes, and EST2p sequences. The PGR method used in this Example is a novel 

10 method that is designed to specifically amplify rare DNA sequences from complex 

mixtures. This method avoids the problem of amplification of DNA products with the 
same PGR primer at both ends (i.e., single primer products) commonly encountered in 
PGR cloning methods. These single primer products produce unwanted background 
and can often obscure the amplification and detection of the desired two-primer 

1 5 product. The method used in these experiment preferentially selects for two-primer 

products. In particular, one primer is biotinylated and the other is not. After several 
roimds of PGR amplification, the products are purified using streptavidin magnetic 
beads and two primer products are specifically eluted using heat denaturation. This 
method finds use in settings other than the experiments described in this Example. 

20 Indeed, this method finds use in application in which it is desired to specifically 

amplify rare DNA sequences, including the preliminary steps in cloning methods such 
as 5' and 3; RAGE, and any method that uses degenerate primers in PGR. 

A first PGR run was conducted using Tetrahymena template macronuclear 
DNA isolated using methods known in the art, and the 24-mer forward primer with the 

25 sequence 5' biotin-GGGTATTT(TG)TT 

(TG)TA(TG)(GATG)(GATG)(GATG)AG(GATG)GA-3' (SEQ ID NO:70) designated as 
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"K231," corresponding to the FFYXTE region (SEQ ID N0:71), and the 23-mer 
reverse primer with the sequence 5'- 

CCAGATAT(GATC)A(TGA)(GATC)A(AG)(AG)AA(AG)TC(AG)TC- 3' (SEQ ID 
NO:72), designated as "K220," corresponding to the DDFL(FIL)I region (SEQ ID 
5 NO:73). This PGR reaction contained 2.5 yd DNA (50 ng), 4 |al of each primer (20 

laM), 3 ^illOx PGR buffer, 3 ^xl lOx dNTPs, 2 ^il Mg, 0.3 |nl Tag, and 11.2 ixl dHp. 
The mixture was cycled for 8 cycles of 94°C for 45 seconds, 37°C for 45 seconds, and 
72 °C for 1 minute. 

This PGR reaction was bound to 200 |al streptavidin magnetic beads, washed 

10 with 200 1^1 TE, resuspended in 20 p,l dHjO and then heat-denatured by boiling at 

100°C for 2 minutes. The beads were pulled down and the eluate removed. Then, 2.5 
|il of this eluate was subsequently reamplified using the above conditions, with the 
exception bieng that 0.3 p,l of a -^^P dATP was included, and the PGR was carried out 
for 33 cycles. This reaction was nm a 5% denaturing polyacrylamide gel, and the 

15 appropriate region was cut out of the gel. These products were then reamplified for an 

additional 34 cycles, under the conditions listed above, with the exception being that a 
42°C annealing temperature was used. 

A second PGR run was conducted using Tetrahymena macronuclear DNA 
template isolated using methods known in the art, and the 23-mer forward primer with 

20 the sequence 5' 

AGAATG(CA)G(GATC)(TCA)T(GATC)(TCA)T(GATG)CC(GATG)AA(AG)AA-3' 
(SEQ ID NO:74), designated as "K228," corresponding to the region R(LI)(LI)PKK 
(SEQ ID NO:75), and a reverse primer with the sequence 5'- 
AGGAATC(GT)(GATG)GG(TAG)AT(GATC)(GC)(TA)(AG)TG(AG)TA(AG)GA 3' 

25 (SEQ ID NO:76), designated "K224," corresponding to the CYDSIPR region (SEQ ID 

NO:77). This PGR reaction contained 2.5 |al DNA (50 ng), 4 |al of each primer (20 
\iM), 3 nl lOx PGR buffer, 3 |al lOx dNTPs, 2 \i\ Mg, 0.3 \A a -^^P dATP, 0.3 |al 
Taq, and 10.9 \x\ dHjO. This reaction was run on a 5% denaturing polyacrylamide 
gel, and the appropriate region was cut out of the gel. These products were 
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reamplified for an additional 34 cycles, under the conditions listed above, with the 
exception being that a 42°C annealing temperature was used. 

Ten |j,l of the reaction product from run 1 were bound to streptavidin-coated 
magnetic beads in 200 |li1 TE. The beads were washed with 200 )li1 TE, and then then 
5 resuspended in 20 jil of dH20, heat denatured, and the eluate was removed. Next, 2.5 

p-l of this eluate was reamplified for 33 cycles using the conditions indicated above. 
The reaction product from run 2 was then added to the beads and diluted with 30 |xl 
0.5x SSC. The mixture was heated from 94°C to 50°C. The eluate was removed and 
the beads were washed three times in 0.5x SSC at 55°C. The beads were then 
10 resuspended in 20 |ul dHjO, heat denatured, and the eluate was removed, designated as 
"round 1 eluate" and saved. 

To isolate the Tetrahymena band, the round 1 eluate was reamplified with the 
forward primer K228 (SEQ ID NO:74) and reverse primer K227 (SEQ ID NO:78) 
with the sequence 5'- 

15 CAATTCTC(AG)TA(AG)CA(GATC)(CG)(TA)(CT)TT(AGT)AT(GA)TC-3' (SEQ ID 

NO:78), corresponding to the DIKSCYD region (SEQ ID NO:79). The PGR reactions 
were conducted as described above. The reaction products were run on a 5% 
polyacrylamide gel; the band corresponding to approximately 295 nucleotides was cut 
from the gel and sequenced. 

20 The clone designated as 168-3 was sequenced. The DNA sequence (including 

the primer sequences) was found to be: 

GATTACTCCCGAAGAAAGGATCTTTCCGTCCAATCATGACTTTCTTAAGAAAG 
GACAAGCAAAAAAATATTAAGTTAAATCTAAATTAAATTCTAATGGATAGCC 
AACTTGTGTTTAGGAATTTAAAAGACATGCTGGGATAAAAGATAGGATACTC 

25 AGTCTTTGATAATAAACAAATTTCAGAAAAATTTGCCTAATTCATAGAGAAA 
TGGAAAAATAAAGGAAGACCTCAGCTATATTATGTCACTCTAGACATAAAGA 
CTTGCTAC (SEQ ID NO:80). 

Additional sequence of this gene was obtained by PGR using one unique primer 
designed to match the sequence from 168-3 ("K297" with the sequence 5'- 

30 GAGTGACATAATATACGTGA-3'; SEQ ID N0:1 1 1), and the K231 (FFYXTE) 



primer. The sequence of the fragment obtained from this reaction, together with 168-3 
is as follows (without the primer sequences): 

AAACACAAGGAAGGAAGTCAAATATTCTATTACCGTAAACCAATATGGAAAT 
TAGTGAGTAAATTAACTATTGTCAAAGTAAGAATTTAGTTTTCTGAAAAGAA 
5 TAAATAAATGAAAAATAATTTTTATCAAAAAATTTAGCTTGAAGAGGAGAAT 
TTGGAAAAAGTTGAAGAAAAATTGATACCAGAAGATTCATTTTAGAAATACC 
CTCAAGGAAAGCTAAGGATTATACCTAAAAAAGGATCTTTCCGTCCAATCAT 
GACTTTCTTAAGAAAGGACAAGCAAAAAAATATTAAGTTAAATCTAAATTAA 
ATTCTAATGGATAGCCAACTTGTGTTTAGGAATTTAAAAGACATGCTGGGAT 

1 0 AAAAGATAGGATACTC AGTCTTTGAT AATAAACAAATTTCAGAAAAATTTGC 

CTAATTCATAGAGAAATGGAAAAATAAAGGAAGACCTCAGCTATATTATGTC 
ACTCTA (SEQ ID N0:81). 

The amino acid sequence corresponding to this DNA fragment was found to be: 
KHKEGSQIFYYRKPIWKLVSKLTIVKVRIQFSEKNKQMKNNFYQKIQLEEENLEK 

1 5 VEEKLIPEDSFQKYPQGKLRIIPKKGSFRPIMTFLRKDKQKNIKLNLNQILMDSQL 
VFRNLKDMLGQKIGYSVFDNKQISEKFAQFIEKWKNKGRPQLYYVTL (SEQ ID 
NO:82). 

This amino acid sequence was then aligned with other telomerase genes 
(EST2p, and Euplotes). The aligrmient is shown in Figure 31. Consensus sequence is 
20 also shown in this Figure. 



EXAMPLE 16 

Identification of Schizosaccharomyces pombe Telomerase Sequences 

In this Example, the tezl sequence of S. pombe was identified as a homolog of 
the E. aediculatus pi 23, and S. cerevisiae Est2p. 
25 Figure 33 provides an overall summary of these experiments. In this Figure, 

the top portion (Panel A) shows the relationship of two overlapping genomic clones, 
and the 5825 bp portion that was sequenced. The region designated at "tezr" is the 
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protein coding region, with the flanking sequences indicated as well, the box 
underneath the 5825 bp region is an approximately 2 kb Hindlll fragment that was 
used to make tezl disruption construct, as described below. 

The bottom half of Figure 33 (Panel B) is a "close-up" schematic of this same 
5 region of DNA. The sequence designated as "original PGR" is the original degenerate 

PGR fragment that was generated with degenerate oligonucleotide primer pair designed 
based on Euplotes sequence motif 4 (B') and motif 5 (C), as described in previous 
Examples. 

PGR With Degenerate Primers 

10 PGR using degenerate primers was used to find the homolog of the E. 

aediculatus pi 23 in S. pombe. Figure 34 shows the sequences of the degenerate 
primers (designated as "poly 4" and "poly 1") used in this reaction. The PGR runs 
were conducted using the same buffer as described in previous Examples {See e.g., 
Example 10, above), with a 5 minute ramp time at 94°G, followed by 30 cycles of 

15 94°G for 30 seconds, 50°G for 45 seconds, and 72°G for 30 seconds, and 7 minutes at 

72°G, followed by storage at 4°G. PGR runs were conducted using varied conditions, 
{i.e., various concentrations of S. pombe DNA and MgGlj concentrations). The PGR 
products were run on agarose gels and stained with ethidium bromide as described 
above. Several PGR runs resulted in the production of three bands (designated as "T," 

20 "M," and "B"). These bands were re-amplified and run on gels using the same 

conditions as described above. Four bands were observed following this re- 
amplification ("T," "Ml," "M2," and "B"), as shown in Figure 35. These four bands 
were then re-amplified using the same conditions as described above. The third band 
from the top of the lane in Figure 35 was identified as containing the correct sequence 

25 for telomerase protein. The PGR product designated as M2 was found to show a 

reasonable match with other telomerase proteins, as indicated in Figure 36. In addition 
to the alignment shown, this Figwe also shows the actual sequence of tezl. In this 
Figure, the asterisks indicate residues shared with all four sequences {Oxytricha "Ot"; 
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E. aediculatus "Eajpl23"; 5'. cerevisiae "Sc_pl03"; and M2), while the circles (i.e., 
dots) indicate similar amino acid residues. 

3' RT PGR 

In order to obtain additional sequence information, 3' and 5' RT PGR were 
5 conducted on the telomerase candidate identified in Figure 36. Figure 37 provides a 

schematic of the 3' RT PGR strategy used. First, cDNA was prepared from mRNA 
using the oligonucleotide primer "Qj," (5'-GGA GTG AGG AGA GTG AGG AGG 
AGT GGA GGT GAA GGT TTT TTT TTT TTT TT-3'; SEQ ID NO.T02), then using 
this cDNA as a template for PGR with "Qo" (5'-GGA GTG AGG AGA GTG ACG- 

10 3'; SEQ ID NO:103), and a primer designed based on the original degenerated PGR 
reaction (i.e., "M2-T" with the sequence 5'-G TGT GAT TTC TAT ATG GAA GAT 
TTG ATT GAT G-3' (SEQ ID NO: 109). The second PGR reaction (i.e., nested PGR) 
with "Qi" (5'-GAG GAG TGG AGG TCA AGC-3'; SEQ ID NO:104), and another 
PGR primer designed with sequence derived from the original degenerate PGR reaction 

15 or "M2-T2" with the sequence 5'-AC GTA TGG TTT AGG AAA AAG AAA GGA 

TCA GTG-3'; SEQ ID NO: 110). The buffers used in this PGR were the same as 
described above, with amplification conducted beginning with a ramp up of 94° for 5 
min, followed by 30 cycles of 94° for 30 sec, 55°C for 30 sec, and 72°G for 3 min), 
followed by 7 minutes at 72°G. The reaction products were stored at 4°G until use. 

20 Screening of Genomic and cDNA Libraries 

After obtaining this extra sequence information, several genomic and cDNA 
libraries were screened to identify any libraries that contain this telomerase candidate 
gene. The approach used, as well as the libraries and resuhs are shovm in Figure 38. 
In this Figure, Panel A lists the libraries tested in this experiment; Panel B shows the 
25 regions used; Panels C and D show the dot blot hybridization results obtained with 

these libraries. Positive libraries were then screened by colony hybridization to obtain 
genomic and cDNA version of tezl gene. In this experiment, approximately 3x10"* 
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colonies from the Hindlll genomic library were screened and six positive clones were 
identified (approximately 0.01%). DNA was then prepared from two independent 
clones (A5 and B2). Figure 39 shows the results obtained with the ifmdIII-digested 
A5 and B2 positive genomic clones. 
5 In addition, cDNA REP libraries were used. Approximately 3x10^ colonies 

were screened, and 5 positive clones were identified (0.002%). DNA was prepared 
from three independent clones (2-3, 4-1, and 5-20). In later experiments, it was 
determined that 2-3 and 5-20 contained identical inserts. 



5' RT PCR 

10 As the cDNA version of gene produced to this point was not complete, 5' 

RT-PCR was conducted in order to obtain a full length clone. The strategy is 
schematically shovm in Figure 40. In this experiment, cDNA was prepared using 
DNA oligonucleotide primer "M2-B" (5'-CAC TGA TCC TTT CTT TTT CGT AAA 
CGA TAG GT-3'; SEQ ID NO: 105) and "M2-B2" (5'-C ATC AAT CAA ATC TTC 

15 CAT ATA GAA ATG ACA-3'; SEQ ID NO:106), designed from known regions of 

tezl identified previously. An oligonucleotide linker PCR Adapt Sfil with a 
phosphorylated 5' end ("P") (P-GGG CCG TGT TGG CCT AGT TCT CTG CTC-3'; 
SEQ ID NO: 107) was then ligated at the 3' end of this cDNA, and this construct was 
used as the template for nested PCR. In the first round of PCR, PCR Adapt SFI and 

20 M2-B were used as the primers; while PCR Adapt Sfill (5 -GAG GAG GAG AAG 

AGC AGA GAA CTA GGC CAA CAC GCC CC-3'; SEQ ID NO:108), and M2-B2 
(5'- ATC AAT CAA ATC TTC CAT ATA GAA ATG ACA-3'; SEQ ID NO:106) 
were used as primers in the second round. Nested PCR was used to increase 
specificity of reaction. 



25 Sequence Alignments 

Once the sequence of tezl was identified, it was compared with sequences 
previously described. Figure 41 shows the alignment of reverse transcriptase (RT) 
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domains from telomerase catalytic subunits of S. pombe ("S.p. Tezlp"), 5". cerevisiae 
("S.c. Est2p"), and E. aediculatus pl23 ("E.a. pl23"). In this Figure, "h" indicates 
hydrophobic residues, while "p" indicates small polar residues, and "c" indicates 
charged residues. The amino acid residues indicated above the alignment shows the 
5 consensus RT motif of Y. Xiong and T.H. Eickbush (Y. Xiong and T.H. Eickbush, 

EMBO J., 9: 3353-3362 [1990]). The asterisks indicate the residues that are conserved 
for all three proteins. "Motif O" is identified herein as a motif specific to this 
telomerase subunit and not foimd in reverse transcriptases in general. It is therefore 
valuable in identifying other amino acid sequences as being good candidates for 
10 telomerase catalytic subunits. 

Figure 42 shows the alignment of entire sequences from Euplotes ("Ea_pl23"), 
S. cerevisiae ("Sc_Est2p"), and S. pombe ("Sp_Tezlp"). In Panel A, the shaded areas 
indicate residues shared between two sequences. In Panel B, the shaded areas indicate 
residues shared between all three sequences. 

1 5 Genetic Disruption of tezl 

In this Example, the effects of disruption of tezl were investigated. As 
telomerase is involved in telomere maintenance, it was hypothesized that if tezl were 
indeed a telomerase component, disruption of tezl was expected to cause gradual 
telomere shortening. 

20 In these experiments, homologous recombination was used to specifically 

disrupt the tezl gene in 5*. pombe. This approach is schematically illustrated in Figure 
43. As indicated in Figure 43, wild type tezl was replaced with a fragment containing 
the ura4 or LEU2 marker. 

The disruption of tezl gene was confirmed by PGR (Figure 44) , and Southern 

25 blot was performed to check for telomere length. Figure 45 shows the Southern blot 

results for this experiment. Because an Apa I restriction enzyme site is present 
immediately adjacent to telomeric sequence in S. pombe, digestion of S. pombe 
genomic DNA preparations permits analysis of telomere length. Thus, DNA from S. 
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pombe was digested with Apal and the digestion products were run on an agarose gel 
and probed with a telomeric sequence-specific probe to determine whether the 
telomeres of disrupted S. pombe cells were shortened. The results are shown in Figure 
45. From these results, it was clear that disruption of the tezl gene caused a 
5 shortening of the telomeres. 



EXAMPLE 17 
Identification of Human Telomerase Motifs 



In this Example, the nucleic and amino acid sequence information for human 
telomerase motifs was determined. These sequences were first identified in a BLAST 

10 search conducted using the Euplotes 123 kDa peptide, and a homologous sequence 

from Schizosaccharomyces, designated as "tezl." The human sequence (also referred 
to as "hTCPl.l") was identified from a cDNA clone (GenBank accession 
#AA281296). Sequences from this clone were aligned with the sequences determined 
as described in previous Examples. 

15 Figure 25 shows the sequence alignment of the Euplotes ("pl23"), 

Schizosaccharomyces ("tezl"), Est2p {i.e., the S. cerevisiae protein encoded by the 
Est2 nucleic acid sequence, and also referred to herein as "L8543.12"), and the human 
homolog identified in this comparison search. The amino acid sequence of this 
aligned portion is provided in SEQ ID N0:61 (the cDNA sequence is provided in SEQ 

20 ID NO:62), while the portion of tezl shown in Figure 25 is provided in SEQ ID 

NO:63. The portion of Est2 shown in this Figure is also provided in SEQ ID NO:64, 
while the portion of pl23 shown is also provided in SEQ ID NO:65. 

As shovm in Figure 25, there are regions that are highly conserved among these 
proteins. For example, as shovra in this Figure, there are regions of identity in "Motif 

25 0," "Motif 1, "Motif 2," and "Motif 3." The identical amino acids are indicated with 

an asterisk (*), while the similar amino acid residues are indicated by a circle (•). 
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This indicates that there are regions within the telomerase motifs that are conserved 
among a wide variety of eukaryotes, ranging from yeast to ciUates, to humans. It is 
contemplated that additional organisms will likewise contain such conserved regions of 
sequence. 

5 Figure 27 shows the amino acid sequence of the cDNA clone encoding human 

telomerase motifs (SEQ ID NO:67), while Figure 28 shows the DNA sequence of the 
clone. Figure 29 shows the amino acid sequence of tezl (SEQ ID NO:68), while 
Figure 30 shows the DNA sequence of tezl (SEQ ID NO:69). In Figure 30, the 
introns and other non-coding regions, are shown in lower case, while the exons {i.e., 
10 coding regions are shown in upper case 



- 95 - 




EXAMPLE 18 
Identification of Human Telomerase Sequences 

In this Example, human telomerase sequences were identified. To generate this 
sequence, Sanger dideoxy sequencing methods were used, as known in the art. The 
5 primers used in the sequencing are shown in the following Table. 



Table 3.Primers 



Primer 


Sequence 


SEQ ID NO: 


TCPl.l 


GTGAAGGCACTGTTCAGCG 


SEQ ID NO:87 


TCP 1.2 


GTGGATGATTTCTTGTTGG 


SEQ ID NO:88 


TCP1.3 


ATGCTCCTGCGTTTGGTGG 


SEQ ID NO:89 


TCP 1.4 


CTGGACACTCAGCCCTTGG 


SEQ ID NO: 90 


TCP 1.5 


GGCAGGTGTGCTGGACACT 


SEQ ID N0:91 


TCP 1.6 


TTTGATGATGCTGGCGATG 


SEQ ID NO:92 


TCP1.7 


GGGGCTCGTCTTCTACAGG 


SEQ ID NO:93 


TCP1.8 


CAGCAGGAGGATCTTGTAG 


SEQ ID NO:94 


TCP1.9 


TGACCCCAGGAGTGGCACG 


SEQ ID NO:95 


TCPl.lO 


TCAAGCTGACTCGACACCG 


SEQ ID NO:96 


TCPl.l 1 


CGGCGTGACAGGGCTGC 


SEQ ID NO:97 


TCP1.12 


GCTGAAGGCTGAGTGTCC 


SEQ ID NO:98 


TCPl.l 3 


TAGTCCATGTTCACAATCG 


SEQ ID NO:99 
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The above primers were designed to hybridize to the 712562 cDNA clone 
(GenBank accession #AA281296). This sequence was generated from fragments 
generated using PGR primers complementary to plasmid backbone sequence and the 
sequences of the 712562 cDNA clone. In addition, 5 '-RACE was used to obtain 
5 clones encoding portions of the previously uncloned regions. In these experiments, 

RACE (Rapid Amplification of cDNA Ends; See e.g., M.A. Frohman, "RACE: Rapid 
Amplification of cDNA Ends," in Innis et al. (eds), PCR Protocols: A Guide to 
Methods and Applications [1990], pp. 28-38; and Frohman et al, Proc. Natl. Acad. 
Sci., 85:8998-9002 [1988]) was used to produce sufficient sequence for ready sequence 

10 analysis. Four such clones were generated are used to provide additional 5' sequence 

information (pFWRP5, 6, 19, and 20). This experiment resulted in the identification 
of an open reading frame that encodes a approximately 63 kD protein that is believed 
to encode either the entire protein of active telomerase, or an active subfragment of 
human telomerase protein. The sequence of the longest open reading frame identified 

15 in this experiment is shown in Figure 47 (SEQ ID NO: 100). The ORF begins at the 

ATG codon with the "met" indicated above. The poly A tail at the 3' end of the 
sequence is also shown. 

Figure 48 shows a tentative alignment of telomerase reverse transcriptase 
proteins from the human sequence (human Telomerase Core Protein 1, "Hs TCPl"), E. 

20 aediculatus pl23 ("Ep pl23), S. pombe tezl ("Sp Tezl"), S. cerevisiae EST2 (Sc 

Est2"), and consensus sequence. In this Figure various motifs are indicated. 

From the above, it is clear that the present invention provides nucleic acid and 
amino acid sequences, as well as other information regarding telomerase, telomerase 
protein subunits, and motifs from various organisms, in addition to methods for 
25 identification of homologous structures in other organisms in addition to those 

described herein. 

All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described 
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method and system of the invention will be apparent to those skilled in the art without 
departing from the scope and spirit of the invention. Although the invention has been 
described in connection with specific preferred embodiments, it should be understood 
that the invention as claimed should not be unduly limited to such specific 
embodiments. Indeed, various modifications of the described modes for carrying out 
the invention which are obvious to those skilled in molecular biology or related fields 
are intended to be within the scope of the following claims. 




CLAIMS 



We claim: 



A substantially purified peptide comprising the amino acid sequence 
selected from the group consisting of SEQ ID NOS:71, 73, 75, 77, 79, 82, 83, 83, 85, 



2. A purified, isolated polynucleotide sequence encoding the polypeptide of 
Claim 1. 

3. The polynucleotide sequence of Claim 2, wherein said polynucleotide 
hybridizes specifically to telomerase sequences, wherein said telomerase sequences are 

10 selected from the group consisting of human, Euplotes aediculatus, Oxytricha, 

Schizosaccharomyces, and Saccharomyces telomerase sequences 

4. The polynucleotide sequence of Claim 3, comprising the complement of 
a nucleic acid sequence selected from the group consisting of SEQ ID NOS:70, 72, 74, 
76, 78, 80, 81, and 100, and variants thereof 

15 ^ A polynucleotide sequence that hybridizes under stringent conditions to 

a nucleic acid sequence selected from the group consisting of SEQ ID NOS:66, 69, 80, 
and 81. 

6. The polynucleotide sequence of Claim 5, wherein said polynucleotide 
sequence is selected from the group consisting of SEQ ID NOS:70, 72, 74, 76, 78, 87, 
20 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 102, 103, 104, 105, 106, 107, 108, 109, 

and 110. 



5 



86, and 101. 



- 99 - 





7. The polynucleotide sequence of Claim 6, wherein said nucleotide 
sequence comprises a purified, synthetic nucleotide sequence having a length of about 
ten to fifty nucleotides. 



encoding at least a portion of human telomerase in a biological sample, comprising the 
steps of: 

a) providing: 

i) a biological sample suspected of containing nucleic acid 
corresponding to the polynucleotide sequence of SEQ ID NO: 100; 

ii) the nucleotide sequence of SEQ ID NO: 100, or a 
fragment thereof; 

b) combining said biological sample with said nucleotide under 
conditions such that a hybridization complex is formed between said nucleic 
acid and said nucleotide; and 

c) detecting said hybridization complex. 

9. The method of Claim- 8, wherein, said nucleic acid corresponding to the 
nucleotide sequence of SEQ ID NO: 100 is ribonucleic acid. 

10. The method of Claini 9, wherein said detected hybridization complex 
correlates with expression of the polynucleotide of SEQ ID NO: 100 in said biological 



11. The method of Claim 8, wherein, said nucleic acid corresponding to the 
nucleotide sequence of SEQ ID NO: 100 is deoxyribonucleic acid. 

12. The method of Claim 11, wherein said detecting of said hybridization 
complex comprises conditions that permit the detection of alterations in the nucleotide 



8. 



A method for detecting the presence of polynucleotide sequences 



20 



sample. 



25 



of SEQ ID NO: 100 in said biological sample. 
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JJ.^-. An antisense molecule comprising the nucleic acid sequence 
complementary to at least a portion of the nucleotide of SEQ ID NO .100. 

14. A pharmaceutical composition comprising the antisense molecule of 
Claim 13, and a pharmaceutically acceptable excipient. 

15. The polynucleotide sequence of Claim 4, wherein said nucleotide 
sequence is contained on a recombinant expression vector. 

16. The polynucleotide sequence of Claim 15, wherein said expression 
vector containing said nucleotide sequence is contained within a host cell. 



17. A method for producing a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 101, the method comprising the steps of: 

a) culturing the host cell of Claim 16, under conditions suitable for 
the expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

IS. , A purified antibody which binds specifically to a polypeptide 
comprising at least a portion of the amino acid sequence of SEQ ID NO: 101. 

19. A pharmaceutical composition comprising the antibody of Claim 18 and 
a pharmaceutically acceptable excipient. 

20. A method for detecting the expression of human telomerase in a 
biological sample comprising the steps of: 

a) providing: 

i) a biological sample suspected of expressing human 
telomerase protein; and 

ii) the antibody of Claim 18; 
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b) combining said biological sample and said antibody under 
conditions such that an antibody:protein complex is formed; and 

c) detecting said complex wherein the presence of said complex 
correlates with the expression of said protein in said biological sample. 
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ABSTRACT 

The present invention is directed to novel telomerase nucleic acids and amino 
acids. In particular, the present invention is directed to nucleic acid and amino acid 
sequences encoding various telomerase protein subunits and motifs, including the 123 
kDa and 43 kDa telomerase protein subunits of Euplotes aediculatus, and related 
sequences from Schizosaccharomyces, Saccharomyces sequences, and human 
telomerase. The present invention is also directed to polypeptides comprising these 
telomerase protein subunits, as well as functional polypeptides and ribonucleoproteins 
that contain these subunits. 
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FIGURE 9 

1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 
5 1 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 
101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 
151 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 
201 TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA TTATAAAGAT 
25 1 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 
301 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 
35 1 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 
401 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
45 1 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
50 1 CTCAAAAGC A GTATTTCTTT C AAGACGAAT GGAACCAAGT TAGAGCAATG 
551 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 
60 1 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
65 1 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
75 1 GAATGAGAAA GATCACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 
801 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
85 1 AACCAATTCT TC AAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
95 1 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 
1 001 CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA CAAAATCTTG 
105 1 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 
1101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 
1151 TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AATTTTTCTA 
1 20 1 CAATATACTC CCCAAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 
1251 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 
1301 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 
1351 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 
] 40 1 ACGTCTTATG G AAATTGCTC CGATGGATAT TCGAGGATCT CGTCGTCTCG 
145 1 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 
1 501 AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 
1551 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
1601 GAAGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 
1 65 1 ACCGAAGAAA ACTACTTTCC GTCCAATTAT G ACTTTCAAT AAGAAGATTG 
1 701 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGITATTG 
1 75 1 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
1801 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
1 85 1 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
195 1 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
2001 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 
2051 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGC ACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 
2151 AACAAAATGA CTTAAATGCA AAGAAAACAT TAATTGTTGA AGCAAAGCAA 
220 1 AG AAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 
225 1 CCAATATAAT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 
2301 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 



FIGURE 9 (cont.) 



2351 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
240 1 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 
245 1 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 
2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 
255 1 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 
2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 
265 1 TCAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 
2701 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 
275 1 CAATGTGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 
2801 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 
285 1 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 
2901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 
295 1 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 
3001 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CA1TATCCAG 
305 1 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTACATTT TCAACAGAGT TTGC ATGATC CTCAAGGCAA AAGAAGCAAA 
3151 GCTAAAAAGT GACC AATGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 
3201 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC 
3251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 



FIGURE 10 

1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 
51 LEDIKIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 
lOI SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 
151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 
201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 
251 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 
301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 
35 1 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 
401 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 
451 LIRCFFYVTE QQKSYSKTYY YRKNIWDVIM KMSIADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTFhJTKLL 
551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDIEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNIVIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 
701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
75 1 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
801 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 
85 1 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 
1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 




FIGURE 11 

1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 
5 1 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 
101 TTTGGATGAT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 
151 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 
201 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 
251 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 
301 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 
351 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 
401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 
451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 
501 GATCTCGATA CATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 
551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 
601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
65 1 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 
701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
75 1 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 
801 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
85 1 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
901 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 
1 001 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 
1 05 1 AAAGAAG AAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 
1101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 
1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 
1201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
1251 TCACATTCAT AGATCGACCT TC ATATATCC AATACGATGA TAAGGAAACA 
1 30 1 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 
1351 AAGAAATGGA GCCGAAATCT TAATCAAAAA GAATTGCGTC GATATTGCAA 
1401 AAGAATCGAA CTCTAAATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 
1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 
1501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 
1551 CGArCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 
1601 ATACAAACCT TGGTCAAAAT ATTGAGGAAG GAAAAGAAGA CCAGTTAGCA 
1651 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 
1701 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGGGG 
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CATTCAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

HCNLARNRLHCLFQSCKNN- 
lEIWLEIAFIDYSKVAKTIR- 
I-f^SGSKSPSLTIPKLQKQLS- 

AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 

TCAAGATCA-iGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTAATCAAAATT ^ ' 



AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 
TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT ' 



^ TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 
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CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 
» » , + , » 540 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 

HMRMSQRISIHQTYQRQTRY 
ICECVKGSRYIRLTKDKLAI- 
YENESKDLDTSDLPKTNSL-- 

AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 

* * * * 600 

TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 



NARKSLI I EQQKNLLHLLFV- 

TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 
1 , » ^ ^ ^ ^ 66 

ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

YGFYYNCFRYRRCTPESCDN 
MGFITIVLG IDGELPSLETI- 
WVLLQLF • VSTVNSRVLRQL- 

TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 
1 , , ^ ^ ^ 

ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

CKSCLQLKESQFCKF'CVCH 
EKAVYNCRNRSSESSDVYAI- 
KKLFTTEGIAVLKVLMCMPL- 

TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

ATAAAACACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 

YFVN-SQISYLNLMDSYRNK 
ILCINLKYLISI-WIAIETN- 
FCELI SNI LSQFNG'L 'KOT- 

CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 
1 , ^ ^ ^ 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 

PNKPCKFNGIYVKSFGTNAH 
OIN"^SlMEYTLNPLGQMHT- 

k-t^qv-wni r - ilwdkctl- 
tgaatttatattggattct-aaagcatagatacacagaatgctttagagactgatttagc 

1 . , ^ gQ(- 

AC'rCAAATA~-_^CCTA^G^-ATTTCGTATCTATGTGTCTTACGAAATCTCTGACTAAATCG 

CIYIGFLXHRYTECFRDCFS 
EFI^OS-SIDTQNALETDLA- 
N L Y ; L K A • I H R M L • R L I • L - 

TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

I ^ ^ ^ ggf, 

AATGTTGTCTA_ATGGACAA.AACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 



GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 

^ ^ ^ jg20 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

GEMKSRLKKEISKFVDSSVT 
AKCKED-RKRFQNLLILL-P- 
RNEKKTKERDFKICCFFCNR- 

GGAATTAACA.ACAAGAATATTAGCAACGAAAAAGAAGAAGAGCTATCACAATCCTGATTC 

^ ^ ^Qgg 

CCTTAATTGTTCTTCTTATA.^TCGTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

CrNrJKNISNEKEEELSOSCF 
ELTTR IIATKKKKSYHNPDS- 
N'-OOEY-QpKRRRAITILIL- 
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TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

1081 + ♦ * ♦ * 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

LKISKIPGKRDTFIKIHIL- 

* RFQKFQVREIHSLKFIYYS- 
KDFKNSR-ERYIH-NSYIIV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 + * 1- ~ H. 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

FFISQLLFSFILTIFFD-LE 
FSFHSCYFLLS *QYFL ISWK- 
FHFTAVIFFYLNNIFCLAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 * + » 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

VKSIK-EKR-TEVT-LIHIH 

* KVSNKRSARLR-LSLFTFI- 
KKYQIREALDCGNLAYSHS'- 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1251 * » » . . 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

RSTFIYPIRC-GNSSHPF-K 
DRPSYIQYDDKETAVIRFKN- 
IDLHISNTMI RKQQSSVLKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 

1321 . * * . 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

• CYED-IFRVKKWSRNLNQK 
SAMRTKFLESRNGAEILIKK- 
VLCGLNF'SQEMEPKS-SKR- 

GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

1381 * * ♦ . 1440 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 



ATCTTGATTG-.TTGAAGAGATTGACGAGGCAA.CTGCACAGAAGATCATTAAAGAAATAAA 

TAGAACTAAC?-\ACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

ILIDCRO-RGNCTEDH-RNK 
SCLIEEIDEATAQKI IKEIK- 
LDCLKRLTRQLHRRSLKK-S- 

GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 

CATTGAAAATA-ATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAACTT 



TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTGGTCAAAAT 

AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

LTK-KLN-S'TIKNTNLGQN 
CRNKSCTKVRO-KIOTLVKI- 
DEIKAELKLDNKKYKPV-JSKY- 

ATTGAGG AAGG AAAAG AAG AC C AGTT AGC AAAAG AAAAAATAAGGC AAT.AAATAAAATCA 

TAACTCCTTCCTTTTCTTCTGCTCAATCGTTTTCTTTTTTATTCCGTTATTTATTTTACT 

lEEGKEOQLAKEKIRO-IXC 
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GTACAGAAGTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 

VQKCRNKRFIFFNNLLKRGV 
YRSEEIKDLFFSriYCKEGF 
TEVKK-KIYFFQ-FIEKRGF 

TTGGGGTTTTGGGGTTTTGGGG 

AACCCCAAAACCCCAAAACCCC 



FIGURE 13 



2 EVDVDNQADNHGIHSALKTCEEIKEAKTLYSWIQKVIRCRNQSQSHYKDL 51 
|:::: |.:-..| I L-l • I l-.l-l-l 

19 ELELEMQENQNDIQVRVK . . . IDDPKQY . . LVNVTAACLLQEGSYYQDK 62 

52 EDIKIFAQTNIVATPRDYNEEDFKVIARKEVF. STGLMIELIDKCLVELL 100 

.. • : . |.l . 1 . .|: ...1 I ...■■].. 

63 DERRYIITKALL EVAESDPEFICQLAVYIRNELYIRTTTNYIVAF 107 

101 SSSDVSDRQKLQCFGFQLKGNQLAKTHLLTALSTQKQYFFQDEWNQVRAM 150 

: -h : h-ll - 

108 . . . CWHKNTQPFIEKYFNKAVLLPNDLLEVCEFAQVLYI 144 

151 IGNELFRHLYTKYLIFQRTSEGTLVQFCGNNVFDHLKVNDKFDKKQKGGA 200 

145 FDATEFKNLY . .. . LDRILSQDIRKELTFRKCLQRCVRSKF 181 

201 ADMNE. . .PRCCSTCKYNVKNEKDHFLNNINVPNWNNMKSRTRIFYCTHF 247 

:ll ■ -I I ■■ i I -h- I-: 

182 SEFNEYQLGKYCTES. . QRKKTMFRYLSVTNKQKWDQTKKK 220 

248 NRNNQFFKKHEFVSNKNNISAMDRAQTIFTNIFRFNRIRKKLKDKVIEKI 297 

1-:. :.I : - I : |: : ] H- | || 

221 RKENLLTKLQAIKESEDKSKRETG . . DIMNVEDAIKALKPAVMKKI 2 64 

298 AYMLEKVKDFNFNYYLTKSCPLPENWRERKQKIENLINKTREEKSKYYEE 347 

I . . .1 . I ..: |.l |.l 

265 AKRQNAMK. . .. KHMKAPKI PNSTLESKYLTFKD 294 

3 48 LFSYTTDNKCVTQFINEFFYNILPKDFLTGRNRKNFQKKVKKYVELNKHE 39 7 

I- - - • I -i-ll 1 -I I 

295 LIKFCHISEP KERVYKILGKKYPKTEEEYKAAFGDSASAPFN PE 338 

398 LIHKNLLLEKINTREISWMQVETSAKHFYYFDHENIYVLWKLLRWIFEDL 447 

339 LAGKRMKIEISKTWENELSAKGNTAEVWDNLISSNQLPYMAMLRNLSN 386 

448 WSLIRCFFYVTEQQKSYSKTYYYRKNIWDVIMKMSIADLKKETLAEVQE 497 
l-l - I 

387 ILKAGVSD. 394 

4 9 8 KEVEEWKKSLG.-APGKLRLIPKKTTFRPIMTFNKKIVNSDRKTTKLTTNT 547 

I I 

39 5 . TTHS 398 

54 8 KLLNSHLMLKTL^JJRMFKDPFGFAVFNYDDVMKKYEEFVCKWKQVGQPKL 597 
I I M I 

399 tVINK ICEPKAVENSKM 4 15 

598 FFATr<DIEKCrDS\/NREKLSTFLKTT:<LLSSDFWIMTAQILKRKNNIv:D 547 

I - 11 I ■ I I I I I I 

416 F, .PLQFFSAIE.JvVN.EAVTKGFKAKK. , RENMNLKGQI EAVKE . - WE 457 

548 SKNrRKKEMKDVFRQKFQKIALEGGQYPTLFSVLENEQNDLNAKKTLIVE 697 

. : I I : I I I - - : - : - . I I ■ ■ . : ■ . ■ 1 - 

458 KTDEEKKDM ELEQTEEGEFVKVNEG IGKQ Y INS I ELA I X 496 

698 AKQRNYFKKDNLL-QPVINICQYNYINFNGKFYKQTKGIPQGLCVSSILSS 747 

. . : . ^ . : : : . . . . : 1 . : : . . | . . : | : 

4 97 lAVNKNLDEIKGHTAIFSDVSGSMSTSMSGGAKKYGSVRTCLECALVLGL 546 

748 FYYATLEESSLGFLRDESMNPENPNVNLLMRLTDDYLLITTQENNAVLF: 797 

: . ::^..| ::: : 

547 MVKQRCEKSSFYIFSSPSSQCNKCYLEVDL 576 

79 8 EKLI^A/SRENGFKF^mKK.LQTSFPLSPSKFAKYGMDSVEEQNIVQDYCD 846 

I I II-- -h-: ■-■■■■■■\ I 

577 PGDELRPSMQKLLQEKGKLGGG. .TDFPYECIDEWTKNKTHVD 617 

847 WIGISIDMKTLALMPNINLRIEGILCTLNLNMQTKKASMWLKKXLKSFLy. 896 

I . 1 I I . . . . : I 1 : I . : I : . : I M - : 

618 NIVILSDMMIAEGYSDINVRGSSIVNSI KKYKDEW 653 

897 tWITHYFRKTITTEDFANKTLNKLFISGGYKYMQCAKEYKD.HFKKNL/^r: 945 

II.: .:.]::: | : : . : . | : . : : : | : : i 

654 PNIKIF. .AVDLEGYG KCLNLGDEFNEfWY I K I FGX 687 

946 SSMIDLEVSKirYSVTRAFFKYLVCNIKOTIFGEEHVPOFFLSTLKHFIE 995 

I- I :|- -. 

688 SDSI EKFISAKQGGA NHVE 706 

996 IFSTKKYIFNRVC 1008 



FIGURE 14 



132 LSTQKQYFFQDEWNQVRAMIGNEL FRHLYTKYLIFQRTSE .GTLVQFC 178 
:| ..i -I llil - III 

1 MSRRNQ . .. KKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQOQI 13 

179 GNNVFDHLKVNDKFDKKOKGGAADMNEPRCCSTCKYNVKNEKDHFLNNIN 228 
:• : II - 111 I |. .. I- 1 

44 KEEDLKLLKFKNQDQDGNSGNDDDDEE NNSNKQQELLRRVN 84 

2 29 VPNWNNMKSRTRIFYCTHFNRNNQFFKKHEFVSNKNNISAMDRAQTIFTN 278 
I il I I - ■ 1 

85 QIKQQVQLIKK VGSKVEKDLNLNEDENKKN 114 

279 IFRFNRIRKKLKDKVIEKIAYMLEKVXDFNFNYYLTKSCPLPENWRERKQ 328 

i -I ■ I |.. - 11 

115 GLSEQQVKEEQLRTITEEQVKYQNLVFNMDYOLDLNESGGHRRHRRETDY 164 

329 KIENLINKTREEKSKYYEELFSYTTDNKCVTQFINE.FFYNILPKDFLTG 3 77 

..|.--: . :.i 1-1- I . II : 

165 DTEKWFEISHDQK. NYVSI YANQKTSYCWWLKDYFNK 200 

378 RNRKNFQKKVKKYVELNKHELIHKNLLLEKINTREISWMQVETSAKHFYY 427 

201 NNYDHLNVSIiTOLE. TEAEFYAFDDFSQTIKLTNNSYQTVNID . 242 

4 28 FDHENIYVLWKLLRWI FEDLWSL IRCFFYVTEQQKSYSKTYYYRKNI 475 

■I : llh I ■ II I .1 ...I. I 

243 VNFDNNLCILALLRFLLSLERFNILNIRSSY. . TRNQYNFEKIGELLETI 290 

EVQEKEVEEWKKSLGFAPGKLRL] 



525 IMTFNKKIVNSDRKTTKLTTNTKLLNSHLMLKTLKNRMFKDPFGFAVFNY 575 

: I • i I I l-l .1 : .. : . II I 

331 VYSFSTDLKLVD. TNKVQDYFKFLQEFPRLTHVSQQAI PVSATNAVENL 378 

576 DDVMKKYEEFVCKWKQVGQPKLF FATMDIEKCYDS .VNREK 615 

II -III .. -I I 

3 79 NVLLKKVKH ANLNLVS I PTQFNFDFYFVNLQHLKLEFGLEPNILTKQK 4 26 

516 LSTFL <TTKLLSSDrWIMTAOILKRKNNI VIDSICNFRKKEMK 657 

II I I I I I I I I II 

427 LENLLLSIKOSKNLKFLRLNFYTYVAQETSRKOILKQATTIKNLKNNKNO 476 

538 DYF.RQKFOKIAi^EGGOYPTLFSVLEN EQNDLNAKKTLI VEAKQRNYFK 705 
: I I ! I III 

4 77 EET?ETKDET?SESTSG.XKFFDHLSE--TELEDFSVN LQATQEIY 520 

706 KDNtL^VIN'ICQrNYINrNGKFYKOTKGIPQGLCVSSILSSFYYATLEE 755 

M " I "I I Mi 

52 1 . DSLHKLLIRSTNLKKFKLSYKYEMEKSKMDTFIDLKNI YETLNN 564 

756 SSLGFLRDES^^NPENP^rWLLMRLTDDYLLITTOE^INAVLFIEKLINVSR 305 

h i I - I I : i : 1 I : II- 

565 LKRCS\'NISNPHGNISYELTN KDSTF YKFKLTLNQE 600 

806 ENGFKFNMKKLQTSFPLSPSKFAKYGMDSVEEQNIVQDYCDWIGISIDMK 855 

- |:.:| l-.i.:. | i I - : ■ h 1 • ^ ■ : ■ • 1 • - h ^ - ■ 

601 LQHAKYTFK. . ONEFOFlsTNVKSAKI ESSSLES LEDI DSLCKS lASCKNLQ 648 

856 TLALMPNINLRIEGILCTLNLNMQT. . KKASMWLKK. .KLKSFLfOWITH 901 

1 . : : I : . i ■ - ■ I ■ ■ : : : I - ■ I 1 • = ■ I • 

649 NVNI lASLLYPNNIQKNPFNKPNLLFFKQFEQLKNLENVSINC 691 

902 YFRKTI . . .TTEDFANKTLNKLFISGGYKYMQCAKEYKDHFKKNLAMSSM 948 

.: . I ..| . ... Ml M:|: :|.M|. 

592 ILDQHILNSISEFLEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPEL 741 

949 rDLEVSKIIYS\T RAFFKYLVCNIKDT . . IFGEEHY 982 

:: :.. .:| Uh- I- II- 

742 NQVYINQQLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDF 791 

983 PDFFLS . .TLKHF lEIFSTKKY . IFNRVCMILKAKEAKLKSDOCQSLIQ 1028 

.: :{ .:1.::MI il ■ \ ■ ■ i-.::.!-!:- 

792 DONTVSDDSIKKILESISESKYHHYLSLNPSQSSSLIKSENEEIQELLK 840 



FIGURE 15 



4 OIDLDDIENLLPNTFNKYSSSCSDKKGCKTLKSGSKSPSLTIPK 47 
II - I I.. -ll-l -11 .1- 

617 NVKSAKIESSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPNNIQKNP 666 

48 LQKQLEFYFSDANLYNDSFLRKLVLKSGEQRVE lETLLM 86 

III I. - .1 II - :| 1..::: 

o67 FNKPNLLFFKQFEQLKNLENVSINCILDQHILNSISEFLEKNKKIKAFIL 716 



FIGURE 16 



MEMDIDLDDIENL LPNTFNKYSSSCSDKKGCKTLKSGSKSPS . . - 

11 II I - 1 II-. . 1 II 

I ELAI K I AVNKNLDEI KGHTAI FSDVSGSMSTSMSGGAKKYGSVRTCLEC 
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telomerase p43 LQ] 
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FIGURE 19 



1 aactcattta attactaatt taatcaacaa gattgataaa aagcagtaaa taaaacccaa 
6 1 tagatttaat ttagaaagta tcaattgaaa aatggaaatt gaaaacaact aagcacaata 
121 gccaaaagcc gaaaaattgt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
181 tatataagtt agggttaaga ttgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
24 1 atgtttgttg taggaaggta gttactacta agataaagat gaaagaagat atatcatcac 
301 taaagcactt cttgaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 
361 catccgtaat gaactttaca tcagaactac cactaactac attgtagcat tttgtgttgt 
42 1 ccacaagaat actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
481 tgacttactg gaagtctgtg aatttgcata ggttctctat atttttgatg caactgaatt 
541 caaaaatttg tatcttgata ggatactttc ataagatatt cgtaaggaac tcactttccg 
601 taagtgttta caaagatgcg tcagaagcaa gttttctgaa ttcaacgaat actaacttgg 
661 taagtattgc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa 
72 1 caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcttaa ccaaacttta 
781 ggcaataaag gaatctgaag ataagtccaa gagagaaact ggagacataa tgaacgttga 
84 1 agatgcaatc aaggctttaa aaccagcagt tatgaagaaa atagccaaga gatagaatgc 
901 catgaagaaa cacatgaagg cacctaaaat tcctaactct accttggaat caaagtactt 
96 1 gaccttcaag gatctcatta agttctgcca tatttctgag cctaaagaaa gagtctataa 
1 02 1 gatccttggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtgattc 
1081 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
1141 aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
1201 ttcaagcaat taactcccat atatggccat gttacgtaac ttgtctaaca tcttaaaagc 
1261 cggtgtttca gatactacac actctattgt gatcaacaag atttgtgagc ccaaggccgt 
1321 tgagaactcc aagatgttcc ctcttcaatt ctttagtgcc attgaagctg ttaatgaagc 
1381 agttactaag ggattcaagg ccaagaagag agaaaatatg aatcttaaag gtcaaatcga 
1441 agcagtaaag gaagttgttg aaaaaaccga tgaagagaag aaagatatgg agttggagta 
1501 aaccgaagaa ggagaatttg ttaaagtcaa cgaaggaatt ggcaagcaat acattaactc 
1561 cattgaactt gcaatcaaga tagcagttaa caagaattta gatgaaatca aaggacacac 
1621 tgcaaccttc tctgatgttt ctggttctat gagtacctca atgtcaggtg gagccaagaa 
1681 gtatggttcc gttcgtactt gtctcgagtg tgcattagtc cttggtttga tggtaaaata 
1 74 1 acgttgtgaa aagtcctcat tctacatctt cagttcacct agttctcaat gcaataagtg 
1 80 1 ttacttagaa gttgatctcc Gtggagacga actccgtcct tctatgtaaa aacttttgca 
1 86 1 agagaaagga aaacttggtg gtggtaciga mcccctat gagtgcattg atgaatggac 
1 92 1 aaagaataaa actcacgtag acaatatcgt tattttgtct gatatgatga ttgcagaagg 
1981 atattcagat atcaatgtta gaggcagttc cattgttaac agcatcaaaa agtacaagga 
204 1 tgaagtaaat cctaacatta aaatctttgc agttgactta gaaggttacg gaaagtgcct 
2101 taatctaggt gatgagttca atgaaaacaa ctacatcaag atattcggta tgagcgattc 
2161 aatcttaaag ttcatttcag ccaagcaagg aggagcaaat atggtcgaag ttatcaaaaa 
222 1 ctttgccctt caaaaaatag gacaaaagtg agtttcttga gattcttcta taacaaaaat 
228 1 ctcaccccac ttttttgttt tattgcatag ccattatgaa atttaaatta ttatctattt 
2341 atttaagtta cttacatagt ttatgtatcg cagtctatta gcctattcaa atgattctgc 
2401 aaagaacaaa aaagattaaa a 



FIGURE 20 

MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 

VNVTAACLLQEGSYYQDKDERRYIITKALLEVAESDPEFICQLAVYIRNELYIRTTTO 

YIVAFCWHKNTQPFIEKYFNKAVLLPNDLLEVCEFAQVLYIFDATEFKNLYLDRILS 

QDIRKELTFRJCCLQRCVRSKFSEFNEYQLGKYCTESQRKKTMFRYLSVTNKQKWDQTK 

KKRKENLLTKLQAIKESEDKSKJIETGDIMNVEDAIKALKPAVMKKIAK^ 

APKIPNSTLESKYLTFKDLIKFCHISEPKERVYKILGKKYPKTEEEYKAAFGDSASAP 

FNPELAGKRMKIEISKTWENELSAKGNTAEVWDNLISSNQLPYMAMLRNLSNILKAGV 

SDTTHSIVINKJCEPKAVENSmFPLQFFSAIEAVNEAVTKGFKAKKRENMNLKGQIE 

AVKEWEKTDEEKKDMELEQTEEGEFVKVNEGIGKQYINSIELAIKIAVNKNLDEIKG 

HTAIFSDVSGSMSTSMSGGAKKYGSVRTCLECALVLGLMVKQRCEKSSFYIFSSPSSQ 

CNKCYLEVDLPGDELRPSMQKLLQEKGKLGGGTDFPYECIDEWTKNK-raVDNIVILSD 

MMIAEGYSDINVRGSSIVNSIKKYKDEVNPNIKIFAVDLEGYGKCLNLGDEFNENNYI 

KIFGMSDSILKFISAKQGGANMVEVIKNFALQKIGQK 




FIGURE 21 

1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
6 1 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 
121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 
1 8 1 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
24 1 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 
42 1 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 
481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
54 1 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
601 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 
66 1 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 
72 1 ttcacaaaca atcaaactta ctaataattc ttactagact gttaacatag acgttaattt 
781 tgataataat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 
841 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 
901 gctacttgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 
961 acaagttcct tgcgaagcgt tctaatattt agttaactcc tcatcataaa ttagcgttaa 
1021 agatagctaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 
1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 
1 141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1 20 1 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 
1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 
1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
1 44 1 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 
1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 
1621 acttttgatt agatcaacaa atttaaagaa gttcaaatta agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
1 74 1 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1801 taaagattct actttttata aatttaagct gaccttaaac taagaattat aacacgctaa 
1 86 1 gtatactttt aagtagaacg aattttaatt taataacgtt aaaagtgcaa aaattgaatc 
1 92 1 ttccfeatta gaaagcttag -aagatanga tagtctttgc aaatctattg cttcttgtaa 
1 98 1 aaatttacaa aatgttaata ttatcgccag tttgctctat cccaacaata tttagaaaaa 
204 1 tcctttcaat aagcccaatc ttctattttt caagcaattt gaataattga aaaatttgga 
2101 aaatgtatct atcaactgta ttcttgatca gcatatactt aattctattt cagaattctt 
2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 
222 1 tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 
2281 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 
2341 ccacaagcaa aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 
240 1 cctttagcta atagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 
2461 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 
2521 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
2581 aggtgtttta gtaaaagcat actataaatt ccctctatgt ttaccaactg gtacttatta 
2641 cgattacaat tcagatagat ggtgattaat taaatattag tttaaataaa tattaaatat 
2701 tgaatatttc tttgcttatt atttgaataa tacatacaat agtcattttt agtgttttga 
2761 atatatttta gttatttaat tcattatttt aagtaaataa ttatttttca atcatttttt 
2821 aaaaaatcg 




FIGURE 22 

MSRRNQKKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQIK 

EEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRRVNQIKQQVQLIKKVGSKVEK 

DLNLNEDENKKNGLSEQQVKEEQLRTITEEQVKYQNLVFNMDYQLDLNESGGHRRHRR 

ETDYDTEKWFEISHDQKNYVSIYANQKTSYCWWLKXtYFNKNNYDHLNVSINRLETEAE 

FYAFDDFSQTIKLTNNSYQTVNIDVNFDNNLCILALLRFLLSLERFNILNIRSSYTRN 

QYNFEKIGELLETIFAVVFSHRHLQGIHLQVPCEAFQYLVNSSSQISVKDSQLQVYSF 

STDLKLVDTOKVQDYFKPLQEFPRLTHVSQQAIPVSATNAVENLNVLLKKVKHANLNL 

VSIPTQFNFDFYFVNLQHLKLEFGLEPNILTKQKLENLLLSIKQSKNLKFLRLNFYTY 

VAQETSRKQILKQATTIKNLKNNKNQEETPETKDETPSESTSGMKFFDHLSELTELED 

FSVNLQATQEIYDSLHKXLIRSTNLKKFKLSYKYEMEKSKMDTFIDLKNIYETLNNLK 

RCSVNISNPHGNISYELTNKDSTFYKFKJLTLNQELQHAKYTFKQNEFQFNNVKSAKIE 

SSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPNNIQKNPFNKPNLLFFKQFEQLK 

NLENVSINCILDQHILNSISEFLEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPE 

LNQVYINQQLEEL-WSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDFDQNTVSD 

DSIKKJLESISESKYHHYLRJLNPSQSSSLIKSENEEIQELLKACDEKGVLVKAYYKFP 

LC LPTGTY YD YN S D R W 




FIGURE 23 

MKILFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 

PNSRKJALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKIARNEDVKNSLFCHS 

ANVNVTLLKGAAWKMFHSLVGTYAFVDLLINYTVIQFNGQFFTQIVGNRCNEPHLPPK 

WVQRSSSSSATAAQIKQLTEPVTNKQFLHKLNINSSSFFPYSKILPSSSSIKKLTDLR 

EAIFPTNLVKIPQRLKVRINLTLQKLLKRHKRLNYVSILNSICPPLEGTVLDLSHLSR 

QSPKERVLI<a^IIVILQKLLPQEMFGSKJOvJKGKIIK]^NLLLSLPLNGYLPFDSLLKKLL 

RLKDFRWLFISDIWFTKHNFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVTI 

VYFRHDTWT^ITPFIVEYFKTYLVENNVCRNHNSYTLSNFNHSKMRIIPKKSNNEFR 

IIAIPCRGADEEEFTIYKENHKNAIQPTQKILEYLRNKRPTSFTKIYSPTQIADRIKE 

FKQRLLKKFNNVLPELYFMKFDVKSCYDSIPRMECMRILKDALKNENGFFVRSQYFFN 

TOTGVLKLFNVVNASRVPKPYELYIDNVRTVHLSNQDVINVVEMEIFKTALWVEDKCY 

IREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPSQDTLILKLADDFLIISTDQQQV 

INIKKLAMGGFQKYNAKANRDKJLAVSSQSDDDTVIQFCAMHIFVKELEVWKHSSTMN 

NFfflRSKSSKGIFRSLIALFNTRJSYKTIDTNLNSTNTVLMQIDHVVKNISECYKSAF 

KDLSINVTQNMQFHSFLQRIIEMTVSGCPITKCDPLIEYEVRFTILNGFLESLSSNTS 

KFKDNIILLRKEIQHLQAYIYIYIHIVN 
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Oxytricha LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 
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Motif 0 

AKPLHWLMSVYWELLRSFFYVTETTFQKNR 
ISEIEWLVLGKRSNAXMCLSDFEKRKQIFAEF lYWLYNSFl IPILQSFFYITB6SDLRNR 
PST2 LKDFRWLFISD— IWFTKHKFEMLNQLAICFISHLFRQLIPKIIQTPFYCTEISSTVT- 

pl23 TREISWMQVET-SAKHFYYFDHEN-IYVLWKLLRWIFEDLWSLIRCFFYVTEQQKSYSK 



Motif 1 

LFFYRKSVWSKLQSIGIRQHLKRVQLRDVSEAEVRQHREARPALLTSRLRFIPKP— DOL 
tez 1 TVYFRKDIWKLLCRPFI-TSMKMEAFEKIHENNVRMDTQK-TTLPPAVIRLLPKK— KTF 

EST2 IVYFRHDTWNKLITPFIVEYFKTYLVEKNVCRNHNSYTLS-'HFNHSKMRIIPKKSNNEF 
pl23 TYYYRKNIWDVIMKMSI-ADLKKETLABVQBKEVEEMKKS-LOFAPGKLRLIPKK— TTF 



human 
tezl 



Motif 2 

human RplVKMDYWGARTFRREKRAERLTSRVKALP-SVLNYERA 

tezl RLITN-LRKRFLIKMGSNKKMLVSTNQTLRPVASILKHLIHEESSGIPFHLEVYMKLLTF 
EST2 RIIAIPCRGADEEEFTIYKENHKMAIQPTQKILEYLRNKRPTSFTKIYSPTCIADRIKEF 
pi 2 3 RpiMTFNKKIVNSDRKTTKLTTNTKLLHSHLMLKTLKN-RMFKDPPGPAVFNYDDVMKKY 

Motif 3 (A> 

tezl KKDLLKHRMFGR-KKYFVRIDIKSCYDRIKQDLMFRIVKK-KLKDPEPVIRKYATIHATS 
E8T2 KQRLLKKFKNVLPELYFMKFDVKSCYDSIPRMECMRILKD-ALKNENGFFVRSQYFPNTN 
pi 2 3 EBFVCKWKQVG^PKLFPATMDIEKCYDSVNREKLSTFLKTTKLLSSDFWIMTAQILKRKH 




FIGURE 26 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 
ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 
TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 
AT C AC C AAAGGAACGAGT CTTG AAATTTAT C ATTGTT ATTTT ACAGAAGTT ATTAC C C CA 
AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 
AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 
GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 
GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 
ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 
TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 
CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 
AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 
AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 
CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 
TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 
TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 
CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 
TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 
TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 
TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 
GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 
TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 
CAGT CCTAG C CAGGAC ACATTAATTTT AAAACTGGCTGACGATTTCCTTATAATAT CAAC 
AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 
TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 
TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 
CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 
AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACAC7VAATTTAAATTCAACAAA 
CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 
TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 
CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 
TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 
ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 



FIGURE 27 



AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 
VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKR 
AERLTSRVKALFSVLNYERA 



FIGURE 28 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 
TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 
GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAA 
GAGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGA 
AGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACG 
GGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCG 
CAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGC 
GTGCTCAACTACGAGCGGGCGCG 



FIGURE 29 



MTEHHTPKSRILRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTVVGF 

DSKPDEGVQFSSPKCSQSELIANVVKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLV 

STFPNYLISILESKNWQLLLEIIGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSKKRK 

RTIETSITQNKSARKEVSWNSISISRFSIFYRSSYKKFKQDLYFNLHSICDRNTVHMWLQWIFPRQFG 

LINAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHRISLSKVYNHYCPYIDTHDDEKILS 

YSLKPNQVFAFLRSILVRVFPKLIWGNQRJFEIILKDLETFLKLSRYESFSLHYLMSNIKISEIEWLVL 

GKRSNAKMCLSDFEKRKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCRPFIT 

SMKMEAFEKINENNVRMDTQKTTLPPAVIRLLPKKNTFRLITNLRKRFLIKMGSNKKMLVSTNQT 

LRPVASILKHLINEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLMFR 

IVKKKLKJDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 

WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 

LRVVDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENSNGIINNTFFNESKKRMPFFG 

FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCNI 

YRLGYSMCMRAQAYLKRMKDIFIPQRMFITDLLNVIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 

LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRIAD 
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ggtaccgatttactttcctttcttcataagctaattgcttcctcgaacgctcctaaatctctggaaatatttttacaagaactcaataacaataccaagtcaaattccaatatgaagg 
tgttattagtgatcgataatatttctattttatcggtcgttaccaagtataaggacaaaaagaacaacttccttccccctaaagacttttactttattaatttacttttcaaatatatttcg 
ggttcgcttacttttaatcgtggtactgttttagctgctacttctagccaaccgcgtgtttctaccccgtcattggatatagctcttggagtagctcacagaaatccttacaaatctt 
ctgatgagactatattagancattacagtccgtgcatattcttaacatggagccttacactttagatgagtcacgtcgcatgatggagtatttggtatcatccaacgtttgccttg 
aaaaggngataattatttgcaaaatcatgtccnagtggtggtaatccgcgaaagttttttgatgcttgcacacgtctagcatgattgagatancaaaaatttctatccactacaa 
ctcctttaacgcggttttatttttctattttctattctcatgttgttccaaatatgtatcatctcgtattaggcttttttccgttttactcctggaatcgtacctttttcactattccccctaatg 
aataatctaaattagtncgcttataattgatagtagtagaaagattggtgattctactcgtgtaatgttattagtttaaagatactttgcaaaacatttattagctatcattatataaaa 
aaaatcctataattataaatattaatcaatatttgcggtcactatttatttaaaacgttatgatcagtaggacactttgcatatatatagttatgcttaatggttacttgtaacttgcAT 
GACCGAACACCATACCCCCAAAAGCAGGATTCTTCGCTTTCTAGAGAATCAATATGTATACCTATGTA 

ccttaaatgattatgtacaactt6ttttgagagggtcgccggcaagctcgtatagcaatatatgcgaa 

CGCTTGAGAAGCGATGTACAAACGTCCTTTTCTATTTTTCTTCATTCGACTGTAGTCGGCTTCGACAGT 

AAGCCAGATGAAGGTGTTCAATTTTCTTCTCCAAAATGCTCACAGTCAGAGgtatatatatttttgttttgatttttttctattcg 

ggatagctaatatatgggcagCTAATAGCGAATGTTGTAAAACAGATGTTCGATGAAAGTTTTGAGCGTCGAAGGA 

ATCTACTGATGAAAGGGTTTTCCATGgtaaggtattctaattgtgaaatatttacctgcaattactgtttcaaagagattgtatttaaccgataaagAA 

TCATGAAGATTTTCGAGCCATGCATGTAAACGGAGTACAAAATGATCTCGTTTCTACTTTTCCTAATTA 

CCTTATATCTATACTTGAGTCAAAAAATTGGCAACTTTTGTTAGAAATgtaaataccggttaagatgttgcgcactttgaaca 

agactgacaagtatagTATCGGCAGTGATGCCATGCATTACTTATTATCCAAAGGAAGTATTTTTGAGGCTCTTC 

CAAATGACAATTACCTTCAGATTTCTGGCATACCACTTTTTAAAAATAATGTGTTTGAGGAAACTGTGT 

CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGCGCCCGCAAAGAAGTTTC 

CTGGAATAGCATTTCAATTAGTAGGTTTAGCATTTTTTACAGGTCATCCTATAAGAAGTTTAAGCAAGgt 

aactaatactgnatccttcataactaatmagATCTATATTTTAACTTACACTCTATTTGTGATCGGAACACAGTACACATG 

TGGCTTCAATGGATTTTTCCAAGGCAATTTGGACTTATAAACGCATTTCAAGTGAAGCAATTGCACAA 

AGTGATTCCACTGGTATCACAGAGTACAGTTGTGCCCAAACGTCTCCTAAAGGTATACCCTTTAATTGA 

ACAAACAGCAAAGCGACTCCATCGTATTTCTCTATCAAAAGTTTACAACCATTATTGCCCATATATTGA 

CACCCACGATGATGAAAAAATCCTTAGTTATTCCTTAAAGCCGAACCAGGTGTTTGCGTTTCTTCGATC 

CATTCTTGTTCGAGTGTTTCCTAAATTAATCTGGGGTAACCAAAGGATATTTGAGATAATATTAAAAGg 

tattgtataaaamattaccactaacgattttaccagACCTCGAAACTTTCTTGAAATTATCGAGATACGAGTCTTTTAGTTTAC 

ATTATTTAATGAGTAACATAAAGgtaatatgccaaatttttttaccattaattaacaatcagATTTCAGAAATTGAATGGCTAGT 

CCTTGGAAAAAGGTCAAATGCGAAAATGTGCTTAAGTGATTTTGAGAAACGCAAGCAAATATTTGCGG 

AATTCATCTACTGGCTATACAATTCGTTTATAATACCTATTTTACAATCTTTTTTTTATATCACTGAATC 

AAGTGATTTACGAAATCGAACTGTTTATTTTAGAAAAGATATTTGGAAACTCTTGTGCCGACCCTTTAT 

TACATCAATGAAAATGGAAGCGTTTGAAAAAATAAACGAGgtattttaaagtattttttgcaaaaagctaatattttcagAACAA 

TGTTAGGATGGATACTCAGAAAACTACTTTGCCTCCAGCAGTTATTCGTCTATTACCTAAGAAGAATAC 

CTTTCGTCTCATTACGAATTTAAGAAAAAGATTCTTAATAAAGgtattaatttttggtcatcaatgtactttacttctaatctattattag 

cagATGGGTTCAAACAAAAAAATGTTAGTCAGTACGAACCAAACTTTACGACCTGTGGCATCGATACTG 

AAACATTTAATCAATGAAGAAAGTAGTGGTATTCCATTTAACTTGGAGGTTTACATGAAGCTTCTTACT 

TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTGGgtaattatataatgcgcgattcctcattattaattttgcagGCGTAAGAAG 

TATTTTGTACGGATAGATATAAAATCCTGTTATGATCGAATAAAGCAAGATTTGATGTTTCGGATTGTT 

AAAAAGAAACTCAAGGATCCCGAATTTGTAATTCGAAAGTATGCAACCATACATGCAACAAGTGACCG 

AGCTACAAAAAACTTTGTTAGTGAGGCGTTTTCCTATTgtaagtttattttttcattggaattttttaacaaancttttttagTTGATAT 

GGTGCCTTTTGAAAAAGTCGTGCAGTTACTTTCTATGAAAACATCAGATACTTTGTTTGTTGATTTTGT 

GGATTATTGGACCAAAAGTTCTTCTGAAATTTTTAAAATGCTCAAGGAACATCTCTCTGGACACATTGT 

TAAGgtataccaattgttgaattgtaataacactaatgaaactagATAGGAAATTCTCAATACCTTCAAAAAGTTGGTATCCCTC 

AGGGCTCAATTCTGTCATCTTTTTTGTGTCATTTCTATATGGAAGATTTGATTGATGAATACCTATCGTT 

TACGAAAAAGAAAGGATCAGTGTTGTTACGAGTAGTCGACGATTTCCTCTTTATAACAGTTAATAAAA 

AGGATGCAAAAAAATTTTTGAATTTATCTTTAAGAGgtgagttgctgtcattcctaagttctaaccgttgaagGATTTGAGAA 

ACACAATTTTTCTACGAGCCTGGAGAAAACAGTAATAAACTTTGAAAATAGTAATGGGATAATAAACA 

ATACATTGTTAGCATGTCCTAAAATTGATGAAGCCTTATTTAACTCTACATCTGTAGAGCTGACGAAAC 

ATATGGGGAAATCTTTTTTTTACAAAATTCTAAGgtatactgtgtaactgaataatagctgacaaataatcagATCGAGCCTTGC 

ATCCTTTGCACAAGTATTTATTGACATTACCCACAATTCAAAATTCAATTCTTGCTGCAATATATATAG 

GCTAGGATACTCTATGTGTATGAGAGCACAAGCATACTTAAAAAGGATGAAGGATATATTTATTCCCC 

AAAGAATGTTCATAACGGgtgagtacttatmaactagaaaagtcattaattaaccttagATCTTTTGAATGTTATTGGAAGAAAA 

ATTTGGAAAAAGTTGGCCGAAATATTAGGATATACGAGTAGGCGTTTCTTGTCCTCTGCAGAAGTCAA 
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ATGgtacgtgtcggtctcgagacttcagcaatattgacacatcagGCTTTTTTGTCTTGGAATGAGAGATGGTTTGAAACCCTCTT 

TCAAATATCATCCATGCTTCGAACAGCTAATATACCAATTTCAGTCATTGACTGATCTTATCAAGCCGC 

TAAGACCAGTTTTGCGACAGGTGTTATTTTTACATAGAAGAATAGCTGATTAAtgtcattttcaatttattatatacatcctt 

tattactggtgtcttaaacaatattattactaagtatagctgacccccaaagcaagcatactataggatttctagtaaagtaaaattaatctcgttanagttttgattgacttgtcttt 

atccttatacttttaagaaagattgacagtggttgctgactactgcccacatgcccattaaacgggagtggttaaacattaaaagtaatacatgaggctaatctcctttcatttag 

aataaggaaagtggttttctataatgaataatgcccgcactaatgcaaaaagacgaagattatcttctaaacaagggggattaagcatatccgaaggaaaagagagtaatat 

acccagtgngttgaagaaagcaaggataatttggaacaagcttctgcagatgacaggctaaattttggtgaccgaattttggtaaaagccccaggttatccatggtggccg 

gccngctactgagacgaaaagaaactaaggatagtttgaatactaatagctcatttaatgtcttatataaggttttgttttttcctgacttcaattttgcatgggtgaaaagaaata 

gtgttaagccattattggattccgaaatagccaaatttcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcttcaaaaactcctcctgatttaaaggag 

gaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgcaaaaaagaaaatatcattgggagacatctcttgatgaatcagatgcgga 

gagtatctccagcggatccttgatgtcaataacttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 
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EST2 pep 

Euplotes pep 

Trans of tetrsLhymen 

Consensus 



FFYCTEISST VTIVYFRHDT m KLIT I FIVE YFK-TYLVEN 

FFYVTEQQKS YkKTYYYRKN IWDVT-MKMS :iAD LKK ETLA--EVQE 

KHKE GSQIFYYRKP IWKLVSKLTI •i.T<VRIQFSEK MKQMKNKFYQ 

FFY.TE..K. .S..YYYRK. IW...-KL ---F..K V. . 



EST2 pep 

Euplotes pep 

Trans of cetrahymen 

Consensus 



NVCRIJHNSY- TLSNFNHS 

KEVEE^TKKSh ---GFAPG 

KIQLEEENLE KVEEKLIPED SFQKYPQG 



FthllEKKfeNNE pRh 



I \IPCRG 
•ITFNKK 

ESmMTF.RK 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



ADEEBFTIYK ENHKNAIQP!5>;;QKILE^Ii ^ 
rVNSDKKTTK LTTNTXLI>l|imj>lLKT:| <ir- 

DKQKNIK LNLNQILMDS^^LVFRKt<£)- 

K. -K LN.N. .L. .'S''='QL.L. .liKN- 



f^FTSFTKIYS PTQIADRIKE 

RMFK -DPFGFAVFN 

ML-G -QKIGYSWt) 



129 
120 
130 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 



I NVL--- 
; EFVCKWKQVG Q 
ife-A QFIEKWKNKG Yi\ 



P i h FFATMI 



157 
155 
158 
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S-1; FFY VTE TIP QKN RLF FYR KSV WSK 
S-2: RQHLKRVQLRDVSEAEVRQHREA 
S-3; ART FRR EKR AER LTS RVK ALF SVL NYE 

A-1: AICF LHW LMS VYV VEL LRS FFY VTE TTF O 
A-2: LFF YRK SVW SKL QSI GIR QHL KRV QLR DVS 
A-3: PAL LTS RLR FIP KPD GLR PIV NMD YSA^ 
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Genomic DNA Insert 



5825 bp Sequenced 
~2 kb Hind III Fragment 



RT Motifs 1 2 3(A) 4(B') 5(C) 6(D) 



<;DNA 
5-20 



7 8 9 10 11 



Hind III Xco 



Original PCR 
3' RT-PCR 



RT-PCR w/ M2-B14 
RT-PCR w/ M2-B15 Be 
RT-PCR w/ M2-B15 
RT-PCR w/ M2-B16 
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Poly 4 



a g c c teg 
cag acc aaa gga att cca taa gg 
QTKGIPQG 



4(B') 



5(c') 



D Y L L I T 

3'- ctg ctg atg gag gag tag tgg 
a a aaaaa a a 
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Motif B' (4) 

QTKGIPQG 



Motif C (5) 
DDYLLIT 
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PCR Product M2 showed Reasonable Match 
with Other Telomerase Proteins 



Ot LCVSYILSSFYYANLEENALQFIiRKESMDPEKPETNLLMRLT 

Ea_pl23 KGIPQGLCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLTDDYLLIT 

Sp_M2 SILSSFLCHFYMEDLIDEYLSFTKKK GSVLLRW 

Sc_pl03 DGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS -QDTLILKLADDFLIIS 



QKVGIPQG 

caa aaa gtt ggt ate cct cag gg < Actual Genomic Sequence. 

Polv 4 

t t c 

ta a g c c teg 

cag acc aaa gga att cca taa gg > 

ag acc aaa gga att cca tea ggC TCA ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG 
tc tgg ttt cct taa ggt agt ccG AGT TAA GAG AGT AGA AAA AAC ACA GTA AAG ATA TAG 

KGIPSGSILSSFLCHFYM 



GAA GAT TTG ATT GAT GAA 
CTT CTA AAC TAA CTA CTT 

E D L I D E 



TAG CTA TGG TTT ACG AAA AAG 
ATG GAT AGC AAA TGC TTT TTC 

Y L S F T K K 



AAA GGA TCA GTG TTG TTA CGA 
TTT CCT AGT GAG AAC AAT GCT 

K G S V L L R 



GTA GTC gac gac tac etc etc ate ace 
CAT CAG ctg ctg atg gag gag tag tgg 

VVDDYLLIT 

< ctg ctg atg gag gag tag tgg 

a a aaaaa a a 
t t t t 
c c 

Polv 1 



.gac gat ttc etc ttt ata aca < Actual Genomic Sequence. 

D D F L F I T 
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1 . Synthesis of cDNA with Qj Primer. 
mRNA 



3' RT PGR Strategy 



■ AAAAAAAAAAAAAAAA. . 



■ AAAAAAAAAAAAAAAA. . 



2. First Round PCR Using Outside Primer and Qq Primer. 



3. Second Round PCR Using Inside Primer and Q| Primer. 



4. Sequence Second Round PCR Products Using Inside Primer or Q\ Primer. 
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-Size Selected Libraries from P. Nurese 

3~4kb 

5 ~ 6 kb 

7 ~ 8kb 

11 - 12kb 
-Libraries from J.A. Wise 

Sou 3a Partial Digest 

Hind 111 Partial Digest 

cDNA Libraries 

GAD (Gal Activation Domain) Library 
REP Library from R. Allstiire 
REP81ES Library (old) 
REPS IBS Library (new) 
REP41ES Library 



o < e 2 © Q 

P. Nurse z 2 O z o 
o I 1 <^ £i n £t E 




- Original PGR 






3' RT-PCR 
















500 bp 



Mg Cone 
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A5 B2 




Hind 



III Digested Positive Genomic 
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5' RT PGR Strategy 



■ AAAAAAAAAAAAAAAA 



I . Synthesis of cDNA with Specific Downstream Primer. 

mRNA 



■ AAAAAAAAAAAAAAAA 



3' I 



2. Ligate Oligo with 5'-P and blocl<ed 3' to cDNA using T4 l?NA Ligase. 

^ ■,■,„,■„,„„■,„ „ ....-.^^ 



3. First Round PCR 



4. Second Round PCR 
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Alignment of RT Donnains fronn Telonnerase Catalytic Subunits. 

Motif 0 

S.p. Tezlp (429). WLYNSFIIPILQSFFYITESSDLRNRTv^i^FRKDIW ...(35)... 
S.c. Est2p (366). WirRQLIPKIIQTFFYCTEISSTVT-Iv'YFRHDTW ...(35)... 
E.a. pl23 (441). WIFEDLWSLIRCFFYVTEQQKSYSKTYYYRXNIW ...(35)... 

Motif 1 Motif 2 K 
p hh n K hR h R 

S.p. Tezlp AVIR1LPKK--KTFRLITN-LRKRF ...(61)... 
S.c. Est2p SKMRIIPKKSNNWRIIAIPCRGAD ...(62)... 
E.a. pl23 GKLR1:PKK--TTFRPIMTFNKKIV . . . (61) . . . 

Motif 3 (A) AF 

h hDh GY h 
S.p. Tezlp KKYFVRIDIKSCYDRIKQDLMFRIVK . . . (89) . . . 
S.c. Est2p EIYFMKFDVIxSCYDSIPPJ^ECMRILK ...(75)... 
E.a. pl23 KIFFATMDIEKCYDSVNREKLSTFLK . . . (107) . . . 

Motif 4 (B' ) 

hPQG pP hh h 
S.p. Tezlp YLQKVGIPQGSILSSFLCHFYMEDLIDEYLSF ...(6)... 
S.c. Est2p YIREDGLFQGSSLSAPIVDLATxTDDLLEFYSEF ...(8)... 
E.a. pl23 YKQTKGIPQGLCVSSILSSFYYATLEESSLGF ...(14)... 

Y Motif 5(C) Motif 6(D) 

h F DDhhh Gh. h cK h 

S.p. Tezlp VLLPWDDFLFITVNKJCDJ-iXKFLNLSLRGFEKHXFSTSLEKTVINFENS .(205) 
S.c. Est2p LILK1ADDFLIISTDQQQVINIKKLAMGGFQKYI\-AKA-MRDKILAVSSQS .(173) 
E.a. pl23 " LIMRLTDDYLLITTQENNAVLFIEKLINVSRENGFKFMMKKLQTSFPLS . (209) 
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Disruption strategy for the putative telomerase genes. 



wild type telomerase gene 



wild type telomerase ger^e 



Transform with linear fragmerit 
containing tl-ie telomerase gene 
disrupted witli a LEU2 or uro4 marl<er. 



telomerase gene::M 
wild type telomerase gene 




wild type telomerase gene 



Wild Type (X2) 



2. Assay in selective media. 

3. Sporulate, and grow on 
selective media. 




Senescence? (X2) 



(Diese cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 
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An Example of Confirmation of tezl disruption By PCR 




III 
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Fez 7 disruption causes progressive 
shortening of telomeres in S. pombe 



Tetrad 



heterozygote diploid 
wt diploid 
wt haploid 



. c c 

C C C O <D 

<D 0) CD D> D> 

D> O) D) o ^5 

lO O lO O CM 
CM lO 



Generations 
after sporulation 



fezIA tez1+ fezl+ tezlA 



Telomeric 

Apa I 
fragment 
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1 ggtaccgatttactttcctttcttcataagctaattgcttcctcgaacgctcctaaatctctggaaatatttttacaaga 80 
81 actcaataacaataccaagtcaaattccaatatgaaggtgttattagtgatcgataatatttctattttatcggtcgtta 160 
161 ccaagtataaggacaaaaagaacaacttccttccccctaaagacttttactttattaatttacttttcaaatatatttcg 240 
241 ggttcgcttacttttaatcgtggtactgttttagctgctacttctagccaaccgcgtgtttctaccccgtcattggatat 320 
321 agctcttggagtagctcacagaaatccttacaaatcttctgatgagactatattagattcattacagtccgtgcatattc 400 
401 ttaacatggagccttacactttagatgagtcacgtcgcatgatggagtatttggtatcatccaacgtttgccttgaaaag 480 
481 gttgataattatttgcaaaatcatgtccttagtggtggtaatccgcgaaagttttttgatgcttgcacacgtctagcatg 560 
561 attgagatattcaaaaatttctatccactacaactcctttaacgcggttttatttttctattttctattctcatgttgtt 640 
641 ccaaatatgtatcatctcgtattaggcttttttccgttttactcctggaatcgtacctttttcactattccccctaatga 720 
721 ataatctaaattagtttcgcttataattgatagtagtagaaagattggtgattctactcgtgtaatgttattagtttaaa 800 
801 gatactttgcaaaacatttattagctatcattatataaaaaaaatcctataattataaatattaatcaatatttgcggtc 880 
881 actatttatttaaaacgttatgatcagtaggacactttgcatatatatagttatgcttaatggttacttgtaacttgc 958 

959 ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 1018 
IMTEHHTPKSRILRFLENQYV 20 



1019 TAG CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT TTG AGA GGG TCG CCG GCA AGC TCG 
21YLCTLNDYVQLVLRGS PASS 

-1079 TAT AGC AAT ATA TGC GAA CGC TTG AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT 
_-41YSNICERLRSDVQTSFSIFL 

-1139 CAT TCG ACT GTA GTC GGC TTC GAG AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 
. CTVVGFDS K PDEGVQFSS P 



1078 
40 
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1199 AAA TGC TCA CAG TCA GAG gtatatatatttttgttttgatttttttctattcgggatagctaatatatgggcag 1272 

81 K C S Q S E 86 

1273 CTA ATA GCG AAT GTT GTA AAA CAG ATG TTC GAT GAA ACT TTT GAG CGT CGA AGG AAT CTA 1332 

87LIANVVKQMFDESFERRRNL 106 

1333 CTG ATG AAA GGG TTT TCC ATG gtaaggtattctaattgtgaaatatttacctgcaattactgtttcaaagaga 1405 

107 L M K G F S M 113 

1406 ttgtatttaaccgataaag AAT CAT GAA GAT TTT CGA GCC ATG CAT GTA AAC GGA GTA CAA AAT 1469 

114 NHEDFRAMHVNGVQN 128 

1470 GAT CTC GTT TCT ACT TTT CCT AAT TAG CTT ATA TCT ATA CTT GAG TCA AAA AAT TGG CAA 1529 

129 DLVSTFPNYLISILESKNWQ 148 

1530 CTT TTG TTA GAA AT gtaaataccggttaagatgttgcgcactttgaacaagactgacaagtata'g T ATC GGC 1601 

149 L L L E I I G 155 

1602 AGT GAT GCC ATG CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG GCT CTT CCA AAT GAC 1661 

^,,,,156 SDAMHYLLSKGSIFEALPND 175 

2662 AAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT AAA AAT AAT GTG TTT GAG GAA ACT GTG 1721 

.:"176 NYLQISGIPLFKNNVFEETV 195 

^4=722 TCA AAA AAA AGA AAG CGA ACC ATT GAA ACA TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA 17 81 

;-=196 SKKRKRTIETSITQNKSARK 215 

"1=7 82 GAA GTT TCC TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC TAT 1841 

s 216 EVSWNSISISRFSIFYRSSY 235 

SSA2 AAG AAG TTT AAG CAA G gtaactaatactgttatccttcataactaattttag AT CTA TAT TTT AAC 
"1907 

-236 KKFKQD LYFN245 

^^^419 08 TTA CAC TCT ATT TGT GAT CGG AAC ACA GTA CAC ATG TGG CTT CAA TGG ATT TTT CCA AGG 1967 

246 LHSICDr'nTVHMWLQWIFPR 265 

1968 CAA TTT GGA CTT ATA AAC GCA TTT CAA GTG AAG CAA TTG CAC AAA GTG ATT CCA CTG GTA 2 027 

266 QFGLIN AFQVKQLHKVIPLV 285 

202 8 TCA CAG AGT ACA GTT GTG CCC AAA CGT CTC CTA AAG GTA TAC CCT TTA ATT GAA CAA ACA 2087 

286 SQSTVVPKRLLKVYPLIEQT 305 

2 08 8 GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA AAA GTT TAC AAC CAT TAT TGC CCA TAT ATT 2147 

306 AKRLHRISLSKVYNHYCPYI 325 

2148 GAC ACC CAC GAT GAT GAA AAA ATC CTT AGT TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG 22 07 

326 DTHDDEKILSYSLKPNQVFA 345 

22 08 TTT CTT CGA TCC ATT CTT GTT CGA GTG TTT CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA 22 67 

346 FLRSILVRVFPKLIWGNQRI 365 



2268 TTT GAG ATA ATA TTA AAA G gtattgtataaaatttattaccactaacgattttaccag AC CTC GAA ACT 2336 
366 FEIILKD LET 375 
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2337 TTC TTG AAA TTA TCG AGA TAG GAG TCT TTT AGT TTA CAT TAT TTA ATG AGT AAC ATA AAG 2396 

376 FLKLSRYESFSLHYLMSNIK 395 

2397 gtaatatgccaaatttttttaccattaattaacaatcag ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA 2465 

395 ISEIEWLVLG 405 

2466 AAA AGG TCA AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT GCG 2525 

406 KRSNAKMCLSDFEKRKQIFA 425 

252 6 GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA CAA TCT TTT TTT TAT 2585 

426 EFIYWLYNSFIIPILQSFFY 445 

2 586 ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT GTT TAT TTT AGA AAA GAT ATT TGG AAA 2 645 

446 ITESSDLRNRTVYFRKDIWK 465 

2646 CTC TTG TGC CGA CCC TTT ATT ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA -ATA AAC GAG 27 05 

466 LLCRPFITSMKMEAFEKINE 485 

2706 gtattttaaagtattttttgcaaaaagctaatattttcag AAC AAT GTT AGG ATG GAT ACT CAG AAA ACT 2775 

486 N N V R M D T Q K T 495 

277 6 ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT AAG AAG AAT ACC TTT CGT CTC ATT ACG 2 83 5 

496 TLPPAVIRLLPKKNTFRLIT 515 

2836 AAT TTA AGA AAA AGA TTC TTA ATA AAG gtattaatttttggtcatcaatgtactttacttctaatctatta 2906 

516 NLRKRFLIK 524 

2907 ttagcag ATG GGT TCA AAC AAA AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTG 2967 

525 MGSNKKMLVSTNQTLRPV 542 

2 968 GCA TCG ATA CTG AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA TTT AAC TTG GAG 3027 



543 A S I L K H 



562 



302 8 GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT CTT AAG CAC CGA ATG TTT GG gtaat 3088 

563 VYMKLLTFKKDLLKHRMFG 581 

3089 tatataatgcgcgattcctcattattaattttgcag G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA 3155 

532 R K K Y F V R I D I 591 

3156 AAA TCC TGT TAT GAT CGA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG AAA CTC 3215 

592 KSCYDR IKQDLMFRIVKKKL 611 

3216 AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT GCA ACA AGT GAG CGA GCT 3275 

612KDPEFVIRKYATIHATSDRA 631 



3276 ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC TAT T gtaagtttattttttcattggaattttttaacaa 3343 

632 TKNFVSEAFSYF 643 

3344 attcttttttag TT GAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 3405 

644 DMVPFEKVVQLLSMKT 659 

3406 TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT TCT GAA ATT TTT 3465 

660 SDTLFVDFVDYWTKSSSEIF 679 

3466 AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT AAG gtataccaattgttgaattgtaataaca 3532 



680 K M L 



692 
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3533 ctaatgaaactag ATA GGA AAT TCT CAA TAG CTT CAA AAA GTT GGT ATC OCT GAG GGC TCA 3593 

593 IGNSQYLQKVGIPQGS 708 

3594 ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAG CTA TCG 3 653 

709 ILSSFLCHFYMEDLIDEYLS 728 

3 654 TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT TTC CTC TTT ATA AC A 3713 

729 FTKKKGSVLLRVVDDFLFIT 748 

3714 GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT TTA TCT TTA AGA G gtgagttgctgtcattcc 3777 

749 VNKKDAKKFLNLSLRG 764 

3778 taagttctaaccgttgaag GA TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA 3840 

765 FEKHNFSTSLEKTV 778 

3 841 ATA AAC TTT GAA AAT ACT AAT GGG ATA ATA AAC AAT ACT TTT TTT AAT GAA AGC AAG AAA 3900 

779 INFENSNGI INNTFFNESKK 798 

3 901 AGA ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT GAT ACA TTG TTA GCA TGT 3 960 
799 RMPFFGFSVNMRSLDTLLAC 818 

3961 CCT AAA ATT GAT GAA GCC TTA TTT AAC TCT ACA TCT GTA GAG CTG ACG AAA CAT ATG GGG 4020 

819 PKIDEALFNSTSVELTKHMG 838 

4021 AAA TCT TTT TTT TAC AAA ATT CTA AG gtatactgtgtaactgaataatagctgacaaataatcag A TCG 4 089 

839 KSFFYKILR S 848 

4 090 AGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA AAA TTC AAT TCT 4149 
849 SLASFAQVFIDITHNSKFNS 868 

4150 TGC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATG TGT ATG AGA GCA CAA GCA TAC TTA AAA 420 9 

869 CCNIYRLGYSMCMRAQAYLK 888 

4210 AGG ATG AAG GAT ATA TTT ATT CCC CAA AGA ATG TTC ATA ACG G gtgagtacttattttaactaga 4274 

889 RMKDIFIPQRMFITD 903 

4275 aaagtcattaattaaccttag AT CTT TTG AAT GTT ATT GGA AGA AAA ATT TGG AAA AAG TTG GCC 4339 

904 LLNVIGRKIWKKLA 917 

4340 GAA ATA TTA jGGA TAT ACG .AGT AGG CGT TTC TTG TCC TCT GCA GAA GTC AAA TG gtacgtgtc 4401 

918 EILGYTSRRFLSSAEVKW 935 

4402 ggtctcgagacttcagcaatattgacacatcag G CTT TTT TGT CTT GGA ATG AGA GAT GGT TTG AAA 4468 

936 LFCLGMRDGLK 946 

44 69 CCC TCT TTC AAA TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTG ACT 452 8 

947 PSFKYHPCFEQLIYQFQSLT 966 

4 529 GAT CTT ATC AAG CCG CTA AGA CCA GTT TTG CGA CAG GTG TTA TTT TTA CAT AGA AGA ATA 4588 

967 DL I K P LR PV L RQVLFLHRR I 986 



4589 GCT GAT TAA tgtcattttcaatttattatatacatcctttattactggtgtcttaaacaatattattactaagtata 4665 
987 A D * 989 
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4 666 gctgacccccaaagcaagcatactataggatttctagtaaagtaaaattaatctcgttattagttttgattgacttgtct 4745 
4746 ttatccttatacttttaagaaagattgacagtggttgctgactactgcccacatgcccattaaacgggagtggttaaaca 482 5 
4 82 6 ttaaaagtaatacatgaggctaatctcctttcatttagaataaggaaagtggttttctataatgaataatgcccgcacta 4905 

4906 atgcaaaaagacgaagattatcttctaaacaagggggattaagcatatccgaaggaaaagagagtaatatacccagtgtt 4 985 
4986 gttgaagaaagcaaggataatttggaacaagcttctgcagatgacaggctaaattttggtgaccgaattttggtaaaagc 5065 
5066 cccaggttatccatggtggccggccttgctactgagacgaaaagaaactaaggatagtttgaatactaatagctcattta 5145 
5146 atgtcttatataaggttttgttttttcctgacttcaattttgcatgggtgaaaagaaatagtgttaagccattattggat 522 5 
522 6 tccgaaatagccaaatttcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcttcaaaaactcc 5305 
5306 tcctgatttaaaggaggaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgc 5385 

53 86 aaaaaagaaaatatcattgggagacatctcttgatgaatcagatgcggagagtatctccagcggatccttgatgtcaata 5465 

54 66 acttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 5544 





FIGURE 47 



1 



met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAG GTC GTC GAG CTG CTC 



arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser lie 
CTC TTT TTC TAG CGG AAG AGT GTC TGG AGG AAG TTG CAA AGG ATT 

40 50 
gly lie arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG GAG CTG CGG GAG CTG TCG 

60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 
thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAG GGG CTG CGG CGG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAG TAG GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 110 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAG GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAG GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC GCG CGG CCT GAG CTG TAG 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAG GAC ACC ATC CCC CAG 



10 



20 



180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 
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190 



200 



thr tyr cys val arg arg tyr ala val val gin lys ala ala met 

ACG TAG TGC GTG GGT GGG TAT GCC GTG GTG GAG AAG GCC GGC ATG 

210 

gly thr ser ala arg pro ser arg ala thr ser tyr val gin cys 

GGC ACG TGC GCA AGG CCT TCA AGA GGC ACG TCC TAG GTC GAG TGC 

220 230 

gin gly lie pro gin gly ser lie leu ser thr leu leu cys ser 

GAG GGG ATG CCG CAG GGC TCC ATG GTC TCC ACG CTG CTC TGC AGC 

240 

leu cys tyr gly asp met glu asn lys leu phe ala gly lie arg 

CTG TGC TAG GGC GAG ATG GAG AAG AAG CTG TTT GGG GGG ATT" GGG 

250 260 

arg asp gly leu leu leu arg leu val asp asp phe leu leu val 

GGG GAG GGG CTG CTC CTG GGT TTG GTG GAT GAT TTC TTG TTG GTG 

270 

thr pro his leu thr his ala lys thr phe leu arg thr leu val 

ACA CCT GAG CTC AGC CAC GGG AAA ACG TTC CTC AGG ACG GTG GTC 

280 290 

arg gly val pro glu tyr gly cys val val asn leu arg lys thr 

CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAG TTG GGG AAG ACA 

300 

val val asn phe pro val glu asp glu ala leu gly gly thr ala 

GTG GTG AAG TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC ACG GGT 

310 320 

phe val gin met pro ala his gly leu phe pro trp cys gly leu 

TTT GTT GAG ATG GGG GCC CAC GGC CTA TTC GCC TGG TGC GGC CTG 

330 

leu leu asp thr arg thr leu glu val gin ser asp tyr ser ser 

CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAG TCC AGC 

340 350 

tyr ala arg thr ser ile arg ala ser leu thr phe asn arg gly 

TAT GCC CGG ACC TCC ATG AGA GCC AGT CTC ACG TTC AAG GGG GGC 

360 

phe lys ala gly arg asn met arg arg lys leu phe gly val leu 

TTC AAG GCT GGG AGG AAG ATG GGT CGG AAA CTC TTT GGG GTC TTG 

370 380 

arg leu lys cys his ser leu phe leu asp leu gin val asn ser 

CGG CTG AAG TGT GAC AGG CTG TTT GTG GAT TTG CAG GTG AAC AGG 
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390 

leu gin thr val cys thr asn ile tyr lys ile leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAG AAG ATC CTC CTG CTG GAG 

400 410 

ala tyr arg phe his ala cys val leu gin leu pro phe his gin 
GGG TAG AGG TTT CAG GGA TGT GTG CTG CAG CTC CCA TTT CAT CAG 

420 

gin val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 
CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG .GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CGC TCT GCC CTC CGA GGC 

460 470 

arg ala val ala val pro pro ser ile pro ala gin ala asp ser 
CGT GGA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAG TCG 

480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CGG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAG AGC 

490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 
CCA GAG GCA GCT GAG TCG GAA GCT GCC GGG GAC GAG GCT GAC TGC 

510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 



CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 
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AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 48 

Motif -1 

Ep pl23 ...LVVSLIRCFFYVTEQQKSYSKT... 

Sp Tezl ...FIIPILQSFFYITESSDLRNRT... 

Sc Est2 ...LIPKIIQTFFYCTEISSTVTIV... 

HsTCPl ...YVVELLRSFFYVTETTFQKNRL... 

consensus FFY TE 

K 

Motif 0 phhhK hRli R 

Ep pl23 ...KSLGFAPGKLRLIPKKT-TFRPIMTFNKKIV... 

Sp Tezl ...QKTTLPPAVIRLLPKKN-TFRLITNLRKRFL... 

Sc Est2 ...TLSNFNHSKMRIIPKKSNNEFRIIAIPCRGAD 

Hs TCPl ...ARPALLTSRLRPIPKPD-GLRPIVNMDYVVG... 

consensus R PK R I 

AF 

Motif A h hDh GY h 

Ep pl23 ...PKLFFATMDIEKCYDSVNREKLSTFLK.. 

Sp Tezl ...RKKYFVRIDIKSCYDRIKQDLMFRIVK... 

Sc Est2 ...PELYFMKFDVKSCYDSIPRMECMRILK... 

Hs TCPl ...PELYFVKVDVTGAYDTIPQDRLTEVIA...//... 

consensus F D YD 

Motif B hPQG pShh 

Ep pl23 ...NGKFYKQTKGIPQGLCVSSILSSFYYA... 

Sp Tezl ...GNSQYLQKVGIPQGSILSSFLCHFYME... 

Sc Est2 ...EDKCYIREDGLFQGSSLSAPIVDLVYD. 

Hs TCPl ...RATSYVQCQGIPQGSILSTLLCSLCYG... 

consensus G QG S 

Y 

Motif C h FDDhhh 

Eppl23 ...PNVNLLMRLTDDYLLITTQENN... 

Sp Tezl ...KKGSVLLRVVDDFLFITVNKKD... 

Sc Est2 ...SQDTLILKLADDFLIISTDQQQ... 

HsTCPl ...RRDGLLLRLVDDFLLVTPHLTH... 

consensus DD L 

Motif D Ghh cK 

Eppl23 ...NVSRENGFKFNMKKL... 

SpTezl ...LNLSLRGFEKHNFST... 

Sc Est2 ...KKLAMGGFQKYNAKA... 

HsTCPl ...LRTLVRGVPEYGCVV... 

consensus G 
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